Utilization of a reinforcement learning algorithm for the accurate alignment of a robotic arm in a complete soft fabric shoe tongues automation process

Abstract As usher in Industry 4.0, there has been much interest in the development and research that combine artificial intelligence with automation. The control and operation of equipment in a traditional automated shoemaking production line require a preliminary subjective judgment of relevant manufacturing processes, to determine the exact procedure and corresponding control settings. However, with the manual control setting, it is difficult to achieve an accurate quality assessment of an automated process characterized by high uncertainty and intricacy. It is challenging to replace handicrafts and the versatility of manual product customization with automation techniques. Hence, the current study has developed an automatic production line with a cyber-physical system artificial intelligence (CPS-AI) architecture for the complete manufacturing of soft fabric shoe tongues. The Deep-Q reinforcement learning (RL) method is proposed as a means of achieving better control over the manufacturing process, while the convolutional and long short-term memory artificial neural network (CNN + LSTM) is developed to enhance action speed. This technology allows a robotic arm to learn the specific image feature points of a shoe tongue through repeated training to improve its manufacturing accuracy. For validation, different parameters of the network architecture are tested, and the test convergence accuracy was found to be as high as 95.9 %. During its actual implementation, the production line completed 509 finished products, of which 349 products were acceptable due to the anticipated measurement error. This showed that the production line system was capable of achieving optimum product accuracy and quality with respect to the performance of repeated computations, parameter updates, and action evaluations.

[1]  Jun Hong,et al.  Repetitive assembly action recognition based on object detection and pose estimation , 2020 .

[2]  Muhammed Selman Eryilmaz,et al.  Southeast Europe Journal of Soft Computing , 2012 .

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Robert B. Fisher,et al.  Best Viewpoint Tracking for Camera Mounted on Robotic Arm with Dynamic Obstacles , 2017, 2017 International Conference on 3D Vision (3DV).

[5]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[6]  John G. Breslin,et al.  Big data and stream processing platforms for Industry 4.0 requirements mapping for a predictive maintenance use case , 2020 .

[7]  Dirk P. Kroese,et al.  Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[8]  Subramaniam Parasuraman,et al.  Mobile robot navigation: neural Q-learning , 2012, Int. J. Comput. Appl. Technol..

[9]  Paul M. Bosscher,et al.  Real-time collision avoidance algorithm for robotic manipulators , 2009 .

[10]  Li Lin,et al.  Remaining useful life estimation of engineered systems using vanilla LSTM neural networks , 2018, Neurocomputing.

[11]  Chun-Wei Yang,et al.  Applications of artificial intelligence in intelligent manufacturing: a review , 2017, Frontiers of Information Technology & Electronic Engineering.

[12]  Zhenyu Liu,et al.  Petri-net-based dynamic scheduling of flexible manufacturing system via deep reinforcement learning with graph convolutional network , 2020 .

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Jay Lee,et al.  A Cyber-Physical Systems architecture for Industry 4.0-based manufacturing systems , 2015 .

[15]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[16]  Dazhong Wu,et al.  Deep learning for smart manufacturing: Methods and applications , 2018, Journal of Manufacturing Systems.

[17]  Takehisa Yairi,et al.  A review on the application of deep learning in system health management , 2018, Mechanical Systems and Signal Processing.

[18]  John E. Laird,et al.  Learning procedural knowledge through observation , 2001, K-CAP '01.

[19]  N. Jazdi,et al.  Cyber physical systems in the context of Industry 4.0 , 2014, 2014 IEEE International Conference on Automation, Quality and Testing, Robotics.

[20]  Andrew G. Barto,et al.  Adaptive linear quadratic control using policy iteration , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[21]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[22]  Jorge Pinho de Sousa,et al.  An Innovative Framework for the Simulation of Manufacturing Systems: An Application to the Footwear Industry , 2013 .

[23]  Sung Wook Baik,et al.  Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features , 2018, IEEE Access.

[24]  Guanghua Xu,et al.  Health indicator construction of machinery based on end-to-end trainable convolution recurrent neural networks , 2020 .

[25]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[26]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[27]  Jingyuan Zhang,et al.  Application of Artificial Neural Network Based on Q-learning for Mobile Robot Path Planning , 2006, 2006 IEEE International Conference on Information Acquisition.

[28]  Jack Hollingum Quick‐change automation for shoe manufacture , 1996 .

[29]  Ze-Hao Lai,et al.  Smart augmented reality instructional system for mechanical assembly towards worker-centered intelligent manufacturing , 2020 .

[30]  Alie El-Din Mady,et al.  Validation of Industrial Cyber-Physical Systems: An Application to HVAC Systems , 2016, CSDM.

[31]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[32]  Jyh-Horng Chou,et al.  Data-Driven Approach to Using Uniform Experimental Design to Optimize System Compensation Parameters for an Auto-Alignment Machine , 2018, IEEE Access.

[33]  Ronay Ak,et al.  A Survey of the Advancing Use and Development of Machine Learning in Smart Manufacturing. , 2018, Journal of manufacturing systems.

[34]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[35]  Lihui Wang,et al.  Remote human–robot collaboration: A cyber–physical system application for hazard manufacturing environment , 2020 .

[36]  Yunhe Pan,et al.  Heading toward Artificial Intelligence 2.0 , 2016 .

[37]  Yun Zhang,et al.  Attention guided neural network models for occluded pedestrian detection , 2020, Pattern Recognit. Lett..

[38]  Jyh-Horng Chou,et al.  Optimized Positional Compensation Parameters for Exposure Machine for Flexible Printed Circuit Board , 2015, IEEE Transactions on Industrial Informatics.