论文信息 - From Rocks to Walls: a Model-free Reinforcement Learning Approach to Dry Stacking with Irregular Rocks

From Rocks to Walls: a Model-free Reinforcement Learning Approach to Dry Stacking with Irregular Rocks

In-situ resource utilization (ISRU) is a key aspect for an efficient human exploration of extraterrestrial environments. A cost-effective method for the construction of preliminary structures is dry stacking with locally found unprocessed rocks. This work focus on learning this task from scratch. Former approaches rely on previously acquired models of rocks, which may be hard to obtain in the context of a mission. In alternative, we propose a model-free, data driven approach. We formulate the problem as the task of selecting the position to place each rock on top of the currently built structure. The rocks are presented to the robot in sequence. The goal is to assemble a wall that approximates a target volume, given the 3D perception of the currently built structure, the next object and the target volume. An agent is developed to learn this task using reinforcement learning. The deep Q-networks (DQN) algorithm is used, where the Q-network outputs a value map corresponding to the expected return of placing the object in each position of a top-view depth image. The learned policy outperforms engineered heuristics, both in terms of stability of the structure and similarity with the target volume. Despite the simplification of the task, the policy learned with this approach could be applied to a realistic setting as the high level planner in an autonomous construction pipeline.

Alexandre Bernardino | Pedro Vicente | Rodrigo Ventura | André Menezes

[1] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2] Alexandre Bernardino,et al. Where is my hand? Deep hand segmentation for visual self-recognition in humanoid robots , 2021, Robotics Auton. Syst..

[3] Roland Siegwart,et al. Autonomous robotic stone stacking with online next best object target pose planning , 2017, ICRA 2017.

[4] Kurt Sacksteder,et al. In-Situ Resource Utilization for Lunar and Mars Exploration , 2007 .

[5] Paul J. Kennedy,et al. Using Artificial Intelligence to Build with Unprocessed Rock , 2012 .

[6] Yifang Liu,et al. Deep Q-Learning for Dry Stacking Irregular Objects , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7] Nils Napp,et al. Planning for Robotic Dry Stacking with Irregular Stones , 2019 .

[8] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[9] Alexandre Bernardino,et al. Towards markerless visual servoing of grasping tasks for humanoid robots , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[10] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[11] Luca Bertinetto,et al. Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13] Yifang Liu,et al. Dry Stacking for Automated Construction with Irregular Objects , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[14] Stig Anton Nielsen,et al. Fusing design and construction as speculative articulations for the built environment , 2015 .

[15] Michele Perchonok,et al. Guidelines and Capabilities for Designing Human Missions , 2003 .

[16] Tom Schaul,et al. Universal Value Function Approximators , 2015, ICML.

[17] Lisa M. Guidone. Living on the moon. , 1969, British medical journal.

[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19] Yifang Liu,et al. Approximate Stability Analysis for Drystacked Structures , 2019, 2019 International Conference on Robotics and Automation (ICRA).