论文信息 - Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts - 字舞流文

Enhancement of multilayer perceptron model training accuracy through the optimization of hyperparameters: a case study of the quality prediction of injection-molded parts

Injection molding has been broadly used in the mass production of plastic parts and must meet the requirements of efficiency and quality consistency. Machine learning can effectively predict the quality of injection molded part. However, the performance of machine learning models largely depends on the accuracy of the training. Hyperparameters such as activation functions, momentum, and learning rate are crucial to the accuracy and efficiency of model training. This research further analyzed the influence of hyperparameters on testing accuracy, explored the corresponding optimal learning rate, and provided the optimal training model for predicting the quality of injection molded parts. In this study, stochastic gradient descent (SGD) and stochastic gradient descent with momentum were used to optimize the artificial neural network model. Through optimization of these training model hyperparameters, the width testing accuracy of the injection product improved. The experimental results indicated that in the absence of momentum effects, all five activation functions can achieve more than 90% of the training accuracy with a learning rate of 0.1. Moreover, when optimized with the SGD, the learning rate of the Sigmoid activation function was 0.1, and the testing accuracy reached 95.8%. Although momentum had the least influence on accuracy, it affected the convergence speed of the Sigmoid function, which reduced the number of required learning iterations (82.4% reduction rate). Optimizing hyperparameter settings can improve the accuracy of model testing and markedly reduce training time.

Kun-Cheng Ke | Ming-Shyan Huang | Kun‐Cheng Ke | Ming‐Shyan Huang

[1] Puja Gupta,et al. Breast Cancer Prediction using varying Parameters of Machine Learning Models , 2020 .

[2] Anuraganand Sharma,et al. Guided Stochastic Gradient Descent Algorithm for inconsistent datasets , 2018, Appl. Soft Comput..

[3] Carla P. Gomes,et al. Understanding Batch Normalization , 2018, NeurIPS.

[4] Jian Wang,et al. Filling-To-Packing Switchover Mode Based on Cavity Temperature for Injection Molding , 2011 .

[5] T. Kiatcharoenpol,et al. Optimizing and Modeling for Plastic Injection Molding Process using Taguchi Method , 2018 .

[6] Christian Hopmann,et al. Induced network-based transfer learning in injection molding for process modelling and optimization with artificial neural networks , 2021, The International Journal of Advanced Manufacturing Technology.

[8] Ming-Shyan Huang,et al. Quality Prediction for Injection Molding by Using a Multilayer Perceptron Neural Network , 2020, Polymers.

[9] Kun‐Cheng Ke,et al. Cavity pressure‐based holding pressure adjustment for enhancing the consistency of injection molding quality , 2020 .

[10] Kun‐Cheng Ke,et al. Quality Classification of Injection-Molded Components by Using Quality Indices, Grading, and Machine Learning , 2021, Polymers.

[11] Soonhwan Hwang,et al. Injection mold design of reverse engineering using injection molding analysis and machine learning , 2019, Journal of Mechanical Science and Technology.

[12] Quan Wang,et al. Effect of micro injection molding parameters on cavity pressure and temperature assisted by Taguchi method , 2019, Mechanics.

[13] Ke Tang,et al. Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Sepp Hochreiter,et al. The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[15] Arnulf Jentzen,et al. Non-convergence of stochastic gradient descent in the training of deep neural networks , 2020, J. Complex..

[16] Zahid A. Khan,et al. Application of Taguchi Method in the Optimization of Injection Moulding Parameters for Manufacturing Products from Plastic Blend , 2010 .

[17] Xionghui Zhou,et al. Automated multi-objective optimization for thin-walled plastic products using Taguchi, ANOVA, and hybrid ANN-MOGA , 2020 .

[18] Jian Wang,et al. A Novel Process Control Methodology Based on the PVT Behavior of Polymer for Injection Molding , 2013 .

[19] Jianzhong Fu,et al. On-line measurement of cavity pressure during injection molding via ultrasonic investigation of tie bar , 2019, Sensors and Actuators A: Physical.

[20] C. Hopmann,et al. Development of a novel control strategy for a highly segmented injection mold tempering for inline part warpage control , 2020 .

[21] S. Chen,et al. The investigation on PVT control method establishment for scientific injection molding parameter setting and its quality control , 2020 .

[22] Sebastian Bock,et al. A Proof of Local Convergence for the Adam Optimizer , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[23] Xingkang He,et al. Convergence of Momentum-Based Stochastic Gradient Descent , 2020, 2020 IEEE 16th International Conference on Control & Automation (ICCA).