Reinforcement Learning Control of a Forestry Crane Manipulator

Forestry machines are heavy vehicles performing complex manipulation tasks in unstructured production forest environments. Together with the complex dynamics of the onboard hydraulically actuated cranes, the rough forest terrains have posed a particular challenge in forestry automation. In this study, the feasibility of applying reinforcement learning control to forestry crane manipulators is investigated in a simulated environment. Our results show that it is possible to learn successful actuator-space control policies for energy efficient log grasping by invoking a simple curriculum in a deep reinforcement learning setup. Given the pose of the selected logs, our best control policy reaches a grasping success rate of 97%. Including an energy-optimization goal in the reward function, the energy consumption is significantly reduced compared to control policies learned without incentive for energy optimization, while the increase in cycle time is marginal. The energy-optimization effects can be observed in the overall smoother motion and acceleration profiles during crane manipulation.

[1]  Z. Malkovský,et al.  Influence of human factor on the time of work stages of harvesters and crane-equipped forwarders , 2018 .

[2]  Olle Gelin,et al.  Forwarder crane’s boom tip control system and beginner-level operators , 2017 .

[3]  Sergey Levine,et al.  How to train your robot with deep reinforcement learning: lessons we have learned , 2021, Int. J. Robotics Res..

[4]  Leonid B. Freidovich,et al.  Path-Constrained Motion Analysis: An Algorithm to Understand Human Performance on Hydraulic Manipulators , 2015, IEEE Transactions on Human-Machine Systems.

[5]  P. Alam ‘Z’ , 2021, Composites Engineering: An A–Z Guide.

[6]  Joanne C. White,et al.  Evaluating the capacity of single photon lidar for terrain characterization under a range of forest conditions , 2021, Remote Sensing of Environment.

[7]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[8]  Wolfram Burgard,et al.  The limits and potentials of deep learning for robotics , 2018, Int. J. Robotics Res..

[9]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[10]  Sergey Levine,et al.  QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation , 2018, CoRL.

[11]  Leonid B. Freidovich,et al.  Increasing the Level of Automation in the Forestry Logging Process with Crane Trajectory Planning and Control , 2014, J. Field Robotics.

[12]  Tomi Westerlund,et al.  Navigation and Mapping in Forest Environment Using Sparse Point Clouds , 2020, Remote. Sens..

[13]  Marwan Mattar,et al.  Unity: A General Platform for Intelligent Agents , 2018, ArXiv.

[14]  Jiayu Zhou,et al.  Transfer Learning in Deep Reinforcement Learning: A Survey , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Matthew E. Taylor,et al.  Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey , 2020, J. Mach. Learn. Res..

[16]  Anders Hansson,et al.  Semi-autonomous shared control of large-scale manipulator arms , 2010 .

[17]  Pedro La Hera,et al.  What Do We Observe When We Equip a Forestry Crane with Motion Sensors , 2019 .

[18]  Simon Westerberg,et al.  Semi-Automating Forestry Machines Motion Planning, System Integration, and Human-Machine Interaction , 2014 .

[19]  Sukhan Lee,et al.  3D log recognition and pose estimation for robotic forestry machine , 2011, 2011 IEEE International Conference on Robotics and Automation.

[20]  Olle Gelin,et al.  Improved operator comfort and off-road capability through pendulum arm technology , 2020 .