Dealing with missing usage data in defect prediction: A case study of a welding supplier

Abstract End-of-line (EoL) testing is performed to determine product quality by ensuring reliable performance. Even though low-quality products may pass EoL testing, they have a high probability of failure over time. Analyzing product usage data can help to improve EoL testing in this regard. However, current approaches do not consider usage data for this purpose. The major challenge for manufacturers is that they do not have access to comprehensive usage data for their products because customers are unwilling to provide usage data. However, manufacturers obtain some usage data from their sales and service departments i.e., contextual data. In this paper, we introduce an alternative approach to improving EoL testing when usage data from customers are missing. We discuss whether it is possible to predict low-quality products from EoL testing data when only contextual information is available (i.e., historical service data and location data of shipped products). We find that a simple, duration-based product usage threshold is sufficient to separate products affected by the production process (low-quality products) from those affected primarily by usage and environmental factors (long-term influence). Low-quality products could only be predicted by combining EoL data and contextual data. Additionally, we identify frequent patterns of maintained components to tackle the challenge of having limited data and promote user acceptance of our predictive model. Finally, we demonstrate our approach by conducting a case study in the welding industry. Our approach can identify frequent component failures and improve product reliability in countries with varying environmental conditions and rates of product usage. We expect that our findings will improve EoL testing protocols in welding and other industries while improving defect prediction models in general.

[1]  Lei Ren,et al.  Bearing remaining useful life prediction based on deep autoencoder and deep neural networks , 2018, Journal of Manufacturing Systems.

[2]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[3]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[4]  Andres Bustillo,et al.  Smart optimization of a friction-drilling process based on boosting ensembles , 2018, Journal of Manufacturing Systems.

[5]  Ana L. C. Bazzan,et al.  Balancing Training Data for Automated Annotation of Keywords: a Case Study , 2003, WOB.

[6]  Roman Kern,et al.  Ein Industrie 4.0-Use Case in der Motorenproduktion , 2018, Mensch & Computer Workshopband.

[7]  Derya Birant,et al.  Machine learning and data mining in manufacturing , 2021, Expert Syst. Appl..

[8]  Alberto Gómez,et al.  Learning-based scheduling of flexible manufacturing systems using ensemble methods , 2018, Comput. Ind. Eng..

[9]  Christian Endisch,et al.  End-of-line fault detection for combustion engines using one-class classification , 2016, 2016 IEEE International Conference on Advanced Intelligent Mechatronics (AIM).

[10]  Araceli Sanchis,et al.  Fault Detection and Identification Methodology Under an Incremental Learning Framework Applied to Industrial Machinery , 2018, IEEE Access.

[11]  Paula Diehr,et al.  Imputation of missing longitudinal data: a comparison of methods. , 2003, Journal of clinical epidemiology.

[12]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[13]  Agnieszka Kujawińska,et al.  Application of Cluster Analysis in Making Decision About Purchase of Additional Materials for Welding Process , 2018 .

[14]  Bernhard Mitschang,et al.  Analytical Approach to Support Fault Diagnosis and Quality Control in End-Of-Line Testing , 2018 .

[15]  Stefan Thalmann,et al.  Taking Complexity into Account: A Structured Literature Review on Multi-component Systems in the Context of Predictive Maintenance , 2019, EMCIS.

[16]  Giuseppe De Nicolao,et al.  ACID: Automatic Sort-Map Classification for Interactive Process Diagnosis , 2007, IEEE Design & Test of Computers.

[17]  Chang Ouk Kim,et al.  A Convolutional Neural Network for Fault Classification and Diagnosis in Semiconductor Manufacturing Processes , 2017, IEEE Transactions on Semiconductor Manufacturing.

[18]  Okyay Kaynak,et al.  Big Data for Modern Industry: Challenges and Trends [Point of View] , 2015, Proc. IEEE.

[19]  Stefan Thalmann,et al.  Reconciling digital transformation and knowledge protection: a research agenda , 2018 .

[20]  Dimitris Kiritsis,et al.  Product lifecycle management – from its history to its new role , 2010 .

[21]  Lihui Wang,et al.  Predictive Maintenance : Literature Review and Future Trends , 2015 .

[22]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[23]  V. Pammer-Schindler,et al.  An investigation of knowledge protection practices in inter-organisational collaboration: protecting specialised engineering knowledge with a practice based on grey-box modelling , 2020 .

[24]  Pia Honold Culture and Context: An Empirical Study for the Development of a Framework for the Elicitation of Cultural Influence in Product Usage , 2000 .

[25]  Bernhard Mitschang,et al.  Data-Driven Fault Diagnosis in End-of-Line Testing of Complex Products , 2019, 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[26]  Edward Curry,et al.  The Big Data Value Chain: Definitions, Concepts, and Theoretical Approaches , 2016, New Horizons for a Data-Driven Economy.

[27]  László Monostori,et al.  Lead time prediction using machine learning algorithms: A case study by a semiconductor manufacturer , 2018 .

[28]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[29]  Alexandra Derntl,et al.  Establishing a User-Centered Design Process for Human-Machine Interfaces: Threats to Success , 2019, PROFES.

[30]  Stefan Thalmann,et al.  How Large Manufacturing Firms Understand the Impact of Digitization: A Learning Perspective , 2020, HICSS.

[31]  Christian Endisch,et al.  Robust stochastic process models and parameter estimation for industrial end-of-line-testing , 2018, 2018 IEEE International Conference on Industrial Technology (ICIT).

[32]  Stefan Thalmann,et al.  Cognitive Decision Support for Industrial Product Life Cycles: A Position Paper , 2019 .

[33]  Jae-Yoon Jung,et al.  Imbalanced classification of manufacturing quality conditions using cost-sensitive decision tree ensembles , 2017, Int. J. Comput. Integr. Manuf..

[34]  David W. Coit,et al.  Reliability analysis and condition-based maintenance of systems with dependent degrading components based on thermodynamic physics-of-failure , 2015, The International Journal of Advanced Manufacturing Technology.

[35]  Jongtae Rhee,et al.  Performance Analysis of IoT-Based Sensor, Big Data Processing, and Machine Learning Model for Real-Time Monitoring System in Automotive Manufacturing , 2018, Sensors.

[36]  Manabu Kano,et al.  Prediction and causal analysis of defects in steel products: Handling nonnegative and highly overdispersed count data , 2020 .

[37]  Lakshman S. Thakur,et al.  A big data MapReduce framework for fault diagnosis in cloud-based manufacturing , 2016 .

[38]  John Van Ness,et al.  The Use of Shrinkage Estimators in Linear Discriminant Analysis , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[40]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[41]  AgrawalRakesh,et al.  Mining association rules between sets of items in large databases , 1993 .

[42]  D. Bähre,et al.  Quality Prediction of Drilled and Reamed Bores Based on Torque Measurements and the Machine Learning Method of Random Forest , 2020, Procedia Manufacturing.

[43]  Sang Do Noh,et al.  Implementation of Cyber-Physical Production Systems for Quality Prediction and Operation Control in Metal Casting , 2018, Sensors.

[44]  Robert X. Gao,et al.  Deep learning and its applications to machine health monitoring , 2019, Mechanical Systems and Signal Processing.

[45]  Tsukasa Ike,et al.  A CNN-Based Transfer Learning Method for Defect Classification in Semiconductor Manufacturing , 2018, IEEE Transactions on Semiconductor Manufacturing.

[46]  Yu-Jung Huang,et al.  Machine-Learning Approach in Detection and Classification for Defects in TSV-Based 3-D IC , 2018, IEEE Transactions on Components, Packaging and Manufacturing Technology.

[47]  Kesheng Wang,et al.  Towards zero-defect manufacturing (ZDM)—a data mining approach , 2013, Advances in Manufacturing.

[48]  Yu-Sheng Lin,et al.  Developing a decision support system (DSS) for a dental manufacturing production line based on data mining , 2018 .

[49]  Vladislav Maxim,et al.  Embedded vision equipment of industrial robot for inline detection of product errors by clustering–classification algorithms , 2016 .

[50]  Yisha Xiang,et al.  A review on condition-based maintenance optimization models for stochastically deteriorating system , 2017, Reliab. Eng. Syst. Saf..

[51]  Robert X. Gao,et al.  A Deep Learning Approach for Fault Diagnosis of Induction Motors in Manufacturing , 2017, Chinese Journal of Mechanical Engineering.

[52]  J. Friedman Stochastic gradient boosting , 2002 .

[53]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[54]  Jianyu Long,et al.  Manufacturing Quality Prediction Using Intelligent Learning Approaches: A Comparative Study , 2017 .

[55]  Taghi M. Khoshgoftaar,et al.  A survey on addressing high-class imbalance in big data , 2018, Journal of Big Data.

[56]  Amos H. C. Ng,et al.  Towards data mining based decision support in manufacturing maintenance , 2018 .

[57]  Daming Lin,et al.  A review on machinery diagnostics and prognostics implementing condition-based maintenance , 2006 .

[58]  Antoine Grall,et al.  Joint predictive maintenance and inventory strategy for multi-component systems using Birnbaum's structural importance , 2017, Reliab. Eng. Syst. Saf..

[59]  Damjan Maletič,et al.  The impact of quality management orientation on maintenance performance , 2014 .

[60]  P. Fettke,et al.  Industry 4.0 , 2014, Bus. Inf. Syst. Eng..

[61]  Sukhomay Pal,et al.  Torque based defect detection and weld quality modelling in friction stir welding process , 2017 .

[62]  Carl E. Rasmussen,et al.  Occam's Razor , 2000, NIPS.

[63]  Feng Xiang,et al.  Modeling and recognition of steel-plate surface defects based on a new backward boosting algorithm , 2018 .

[64]  Yongdai Kim,et al.  Failure Prediction Using Sequential Pattern Mining in the Wire Bonding Process , 2017, IEEE Transactions on Semiconductor Manufacturing.