论文信息 - Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

A new generation of automated bin picking systems using deep learning is evolving to support increasing demand for e-commerce. To accommodate a wide variety of products, many automated systems include multiple gripper types and/or tool changers. However, for some objects, sequential grasp failures are common: when a computed grasp fails to lift and remove the object, the bin is often left unchanged; as the sensor input is consistent, the system retries the same grasp over and over, resulting in a significant reduction in mean successful picks per hour (MPPH). Based on an empirical study of sequential failures, we characterize a class of “sequential failure objects” (SFOs) – objects prone to sequential failures based on a novel taxonomy. We then propose three non-Markov picking policies that incorporate memory of past failures to modify subsequent actions. Simulation experiments on SFO models and the EGAD dataset [19] suggest that the non-Markov policies significantly outperform the Markov policy in terms of the sequential failure rate and MPPH. In physical experiments on 50 heaps of 12 SFOs the most effective Non-Markov policy increased MPPH over the Dex-Net Markov policy by 107%.

Ken Goldberg | Michael Danielczuk | Jeffrey Mahler | Kate Sanders | Ajay Tanwani

[1] Ian Taylor,et al. Robotic pick-and-place of novel objects in clutter with multi-affordance grasping and cross-domain image matching , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2] Honglak Lee,et al. Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..

[3] Hongmin Wu,et al. A Latent State-Based Multimodal Execution Monitor with Anomaly Detection and Classification for Robot Introspection , 2019, Applied Sciences.

[4] Ken Goldberg,et al. Adversarial Grasp Objects , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[5] Scott Niekum,et al. Learning grounded finite-state representations from unstructured demonstrations , 2015, Int. J. Robotics Res..

[6] Oliver Kroemer,et al. Learning Robust Manipulation Strategies with Multimodal State Transition Models and Recovery Heuristics , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[7] Kuan-Ting Yu,et al. Robotic pick-and-place of novel objects in clutter with multi-affordance grasping and cross-domain image matching , 2019, The International Journal of Robotics Research.

[8] Dong Liu,et al. Endowing Robots with Longer-term Autonomy by Recovering from External Disturbances in Manipulation Through Grounded Anomaly Classification and Recovery Policies , 2018, Journal of Intelligent & Robotic Systems.

[9] Ken Goldberg,et al. Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[10] Adrian Kratky. Bin Picking System , 2019 .

[11] Manuel G. Catalano,et al. Grasp It Like a Pro: Grasp of Unknown Objects With Robotic Hands Based on Skilled Human Expertise , 2020, IEEE Robotics and Automation Letters.

[12] Dieter Fox,et al. 6-DOF Grasping for Target-driven Object Manipulation in Clutter , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[13] François Chaumette,et al. Grasping Unknown Objects by Coupling Deep Reinforcement Learning, Generative Adversarial Networks, and Visual Servoing , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[14] Matei T. Ciocarlie,et al. Towards Reliable Grasping and Manipulation in Household Environments , 2010, ISER.

[15] Sergey Levine,et al. Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[16] Tapomayukh Bhattacharjee,et al. Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items , 2019, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17] Charles C. Kemp,et al. A Multimodal Anomaly Detector for Robot-Assisted Feeding Using an LSTM-Based Variational Autoencoder , 2017, IEEE Robotics and Automation Letters.

[18] Zhong Shao,et al. Novelty Detection via Network Saliency in Visual-Based Deep Learning , 2019, 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W).

[19] Mustafa İlhan Gökler,et al. Design of an automatic tool changer with disc magazine for a CNC horizontal machining center , 1997 .

[20] Tapomayukh Bhattacharjee,et al. Learning from failures in robot-assisted feeding: Using online learning to develop manipulation strategies for bite acquisition , 2019, ArXiv.

[21] Jimmy A. Jørgensen,et al. Applying a learning framework for improving success rates in industrial bin picking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22] Kenneth Y. Goldberg,et al. Learning Deep Policies for Robot Bin Picking by Simulating Robust Grasping Sequences , 2017, CoRL.

[23] Luca Bertinetto,et al. Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[24] Sankaran Srinivas. Error recovery in robot systems. , 1977 .

[25] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Karol Hausman,et al. Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping , 2019, Robotics: Science and Systems.

[27] Ken Goldberg,et al. Learning ambidextrous robot grasping policies , 2019, Science Robotics.

[28] Andras Toth,et al. Experimental evaluation of a novel automatic service robot tool changer , 2011, 2011 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM).

[29] Jeremy A. Marvel,et al. Addressing perception uncertainty induced failure modes in robotic bin-picking , 2016 .

[30] Chun Yuan,et al. A Reinforcement Learning Approach to Automatic Error Recovery , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[31] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[32] Hokeun Kim,et al. A multimodal execution monitor with anomaly classification for robot-assisted feeding , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33] Sergey Levine,et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[34] Jürgen Leitner,et al. EGAD! An Evolved Grasping Analysis Dataset for Diversity and Reproducibility in Robotic Manipulation , 2020, IEEE Robotics and Automation Letters.

[35] Nigel W. Hardy,et al. Knowledge Based Error Recovery in Industrial Robots , 1983, IJCAI.