Integration of an actor-critic model and generative adversarial networks for a Chinese calligraphy robot

Abstract As a combination of robotic motion planning and Chinese calligraphy culture, robotic calligraphy plays a significant role in the inheritance and education of Chinese calligraphy culture. Most existing calligraphy robots focus on enabling the robots to learn writing through human participation, such as human–robot interactions and manually designed evaluation functions. However, because of the subjectivity of art aesthetics, these existing methods require a large amount of implementation work from human engineers. In addition, the written results cannot be accurately evaluated. To overcome these limitations, in this paper, we propose a robotic calligraphy model that combines a generative adversarial network (GAN) and deep reinforcement learning to enable a calligraphy robot to learn to write Chinese character strokes directly from images captured from Chinese calligraphic textbooks. In our proposed model, to automatically establish an aesthetic evaluation system for Chinese calligraphy, a GAN is first trained to understand and reconstruct stroke images. Then, the discriminator network is independently extracted from the trained GAN and embedded into a variant of the reinforcement learning method, the “actor-critic model”, as a reward function. Thus, a calligraphy robot adopts the improved actor-critic model to learn to write multiple character strokes. The experimental results demonstrate that the proposed model successfully allows a calligraphy robot to write Chinese character strokes based on input stroke images. The performance of our model, compared with the state-of-the-art deep reinforcement learning method, shows the efficacy of the combination approach. In addition, the key technology in this work shows promise as a solution for robotic autonomous assembly.

[1]  Huosheng Hu,et al.  A robot calligraphy system: From simple to complex writing by human gestures , 2017, Eng. Appl. Artif. Intell..

[2]  Miguel Martín,et al.  Possibilistic reward methods for the multi-armed bandit problem , 2018, Neurocomputing.

[3]  Elman Mansimov,et al.  Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.

[4]  Antonio Bandera,et al.  A Survey of Vision-Based Architectures for Robot Learning by Imitation , 2012, Int. J. Humanoid Robotics.

[5]  Hsien-I Lin,et al.  Visual matching of stroke order in robotic calligraphy , 2015, 2015 International Conference on Advanced Robotics (ICAR).

[6]  Zhe Ma,et al.  Stroke Reasoning for Robotic Chinese Calligraphy Based on Complete Feature Sets , 2017, International Journal of Social Robotics.

[7]  Henrik I. Christensen,et al.  An assembly sequence generation of a product family for robot programming , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Chih-Min Lin,et al.  Generative Adversarial Nets in Robotic Chinese Calligraphy , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[9]  Yangsheng Xu,et al.  Robot learns Chinese calligraphy from Demonstrations , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  T J SpaanMatthijs,et al.  Traffic flow optimization , 2016 .

[11]  Miguel Martín,et al.  The Multi-Armed Bandit Problem under Delayed Rewards Conditions in Digital Campaign Management , 2019, 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT).

[12]  Weidong Zhang,et al.  ThermalNet: A deep reinforcement learning-based combustion optimization system for coal-fired boiler , 2018, Eng. Appl. Artif. Intell..

[13]  Sylvain Calinon,et al.  Generating Calligraphic Trajectories with Model Predictive Control , 2017, Graphics Interface.

[14]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[15]  Hang Yin,et al.  Synthesizing Robotic Handwriting Motion by Learning from Human Demonstrations , 2016, IJCAI.

[16]  Min Jiang,et al.  Robotic Free Writing of Chinese Characters via Human-Robot Interactions , 2014, Int. J. Humanoid Robotics.

[17]  Zhang Xin-wei 4-DOF Lettering Robot's Trajectory Planning , 2010 .

[18]  Raffaello D'Andrea,et al.  Towards robotic calligraphy , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Jing Li,et al.  An Integrative Framework of Human Hand Gesture Segmentation for Human–Robot Interaction , 2017, IEEE Systems Journal.

[20]  Min Jiang,et al.  A reduced classifier ensemble approach to human gesture classification for robotic Chinese handwriting , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[21]  Chun-Yi Su,et al.  Teleoperated robot writing using EMG signals , 2015, 2015 IEEE International Conference on Information and Automation.

[22]  Huosheng Hu,et al.  Use of Automatic Chinese Character Decomposition and Human Gestures for Chinese Calligraphy Robots , 2019, IEEE Transactions on Human-Machine Systems.

[23]  Lantao Yu,et al.  SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.

[24]  Honghai Liu,et al.  An Interactive Image Segmentation Method in Hand Gesture Recognition , 2017, Sensors.

[25]  Yangsheng Xu,et al.  A geometric approach to stroke extraction for the Chinese calligraphy robot , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[26]  Rahul B. Warrier,et al.  Iterative Learning From Novice Human Demonstrations for Output Tracking , 2016, IEEE Transactions on Human-Machine Systems.

[27]  Sepp Hochreiter,et al.  GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[28]  Min Jiang,et al.  A developmental approach to robotic pointing via human-robot interaction , 2014, Inf. Sci..

[29]  Carme Torras,et al.  Teaching a Robot the Semantics of Assembly Tasks , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[30]  Jianhua Tao,et al.  Nonrigid point matching of Chinese characters for robot writing , 2017, 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[31]  Changle Zhou,et al.  A computational evaluation system of Chinese calligraphy via extended possibility-probability distribution method , 2017, 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[32]  Matthijs T. J. Spaan,et al.  Traffic flow optimization: A reinforcement learning approach , 2016, Eng. Appl. Artif. Intell..

[33]  Ian D. Walker,et al.  A Gesture Learning Interface for Simulated Robot Path Shaping With a Human Teacher , 2014, IEEE Transactions on Human-Machine Systems.

[34]  Tetsuya Ogata,et al.  Visual motor integration of robot's drawing behavior using recurrent neural network , 2016, Robotics Auton. Syst..

[35]  Chih-Min Lin,et al.  Towards Deep Reinforcement Learning Based Chinese Calligraphy Robot , 2018, 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[36]  Wen Yu,et al.  Human behavior learning for robot in joint space , 2015, Neurocomputing.

[37]  MengChu Zhou,et al.  Teaching a calligraphy robot via a touch screen , 2014, 2014 IEEE International Conference on Automation Science and Engineering (CASE).

[38]  Mengdi Wang,et al.  Evaluation of Chinese Calligraphy by Using DBSC Vectorization and ICP Algorithm , 2016 .

[39]  Veljko Potkonjak Robot Handwriting: Why and How? , 2012 .

[40]  Fenghui Yao,et al.  Modeling of Ancient-style Chinese Character and Its Application to CCC Robot , 2006, 2006 IEEE International Conference on Networking, Sensing and Control.

[41]  Ruifeng Li,et al.  Writing skills transfer from human to robot using stiffness extracted from sEMG , 2015, 2015 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER).

[42]  Philip Bachman,et al.  Deep Reinforcement Learning that Matters , 2017, AAAI.

[43]  Giulio Sandini,et al.  Teaching a humanoid robot to draw ‘Shapes’ , 2011, Auton. Robots.

[44]  Zhe Ma,et al.  Aesthetics Evaluation for Robotic Chinese Calligraphy , 2017, IEEE Transactions on Cognitive and Developmental Systems.