Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

A new generation of automated bin picking systems using deep learning is evolving to support increasing demand for e-commerce. To accommodate a wide variety of products, many automated systems include multiple gripper types and/or tool changers. However, for some objects, sequential grasp failures are common: when a computed grasp fails to lift and remove the object, the bin is often left unchanged; as the sensor input is consistent, the system retries the same grasp over and over, resulting in a significant reduction in mean successful picks per hour (MPPH). Based on an empirical study of sequential failures, we characterize a class of “sequential failure objects” (SFOs) – objects prone to sequential failures based on a novel taxonomy. We then propose three non-Markov picking policies that incorporate memory of past failures to modify subsequent actions. Simulation experiments on SFO models and the EGAD dataset [19] suggest that the non-Markov policies significantly outperform the Markov policy in terms of the sequential failure rate and MPPH. In physical experiments on 50 heaps of 12 SFOs the most effective Non-Markov policy increased MPPH over the Dex-Net Markov policy by 107%.

[1]  Ian Taylor,et al.  Robotic pick-and-place of novel objects in clutter with multi-affordance grasping and cross-domain image matching , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Honglak Lee,et al.  Deep learning for detecting robotic grasps , 2013, Int. J. Robotics Res..

[3]  Hongmin Wu,et al.  A Latent State-Based Multimodal Execution Monitor with Anomaly Detection and Classification for Robot Introspection , 2019, Applied Sciences.

[4]  Ken Goldberg,et al.  Adversarial Grasp Objects , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[5]  Scott Niekum,et al.  Learning grounded finite-state representations from unstructured demonstrations , 2015, Int. J. Robotics Res..

[6]  Oliver Kroemer,et al.  Learning Robust Manipulation Strategies with Multimodal State Transition Models and Recovery Heuristics , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[7]  Kuan-Ting Yu,et al.  Robotic pick-and-place of novel objects in clutter with multi-affordance grasping and cross-domain image matching , 2019, The International Journal of Robotics Research.

[8]  Dong Liu,et al.  Endowing Robots with Longer-term Autonomy by Recovering from External Disturbances in Manipulation Through Grounded Anomaly Classification and Recovery Policies , 2018, Journal of Intelligent & Robotic Systems.

[9]  Ken Goldberg,et al.  Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic Data , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[10]  Adrian Kratky Bin Picking System , 2019 .

[11]  Manuel G. Catalano,et al.  Grasp It Like a Pro: Grasp of Unknown Objects With Robotic Hands Based on Skilled Human Expertise , 2020, IEEE Robotics and Automation Letters.

[12]  Dieter Fox,et al.  6-DOF Grasping for Target-driven Object Manipulation in Clutter , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[13]  François Chaumette,et al.  Grasping Unknown Objects by Coupling Deep Reinforcement Learning, Generative Adversarial Networks, and Visual Servoing , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Matei T. Ciocarlie,et al.  Towards Reliable Grasping and Manipulation in Household Environments , 2010, ISER.

[15]  Sergey Levine,et al.  Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Tapomayukh Bhattacharjee,et al.  Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items , 2019, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17]  Charles C. Kemp,et al.  A Multimodal Anomaly Detector for Robot-Assisted Feeding Using an LSTM-Based Variational Autoencoder , 2017, IEEE Robotics and Automation Letters.

[18]  Zhong Shao,et al.  Novelty Detection via Network Saliency in Visual-Based Deep Learning , 2019, 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W).

[19]  Mustafa İlhan Gökler,et al.  Design of an automatic tool changer with disc magazine for a CNC horizontal machining center , 1997 .

[20]  Tapomayukh Bhattacharjee,et al.  Learning from failures in robot-assisted feeding: Using online learning to develop manipulation strategies for bite acquisition , 2019, ArXiv.

[21]  Jimmy A. Jørgensen,et al.  Applying a learning framework for improving success rates in industrial bin picking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Kenneth Y. Goldberg,et al.  Learning Deep Policies for Robot Bin Picking by Simulating Robust Grasping Sequences , 2017, CoRL.

[23]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[24]  Sankaran Srinivas Error recovery in robot systems. , 1977 .

[25]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Karol Hausman,et al.  Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping , 2019, Robotics: Science and Systems.

[27]  Ken Goldberg,et al.  Learning ambidextrous robot grasping policies , 2019, Science Robotics.

[28]  Andras Toth,et al.  Experimental evaluation of a novel automatic service robot tool changer , 2011, 2011 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM).

[29]  Jeremy A. Marvel,et al.  Addressing perception uncertainty induced failure modes in robotic bin-picking , 2016 .

[30]  Chun Yuan,et al.  A Reinforcement Learning Approach to Automatic Error Recovery , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[31]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[32]  Hokeun Kim,et al.  A multimodal execution monitor with anomaly classification for robot-assisted feeding , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[34]  Jürgen Leitner,et al.  EGAD! An Evolved Grasping Analysis Dataset for Diversity and Reproducibility in Robotic Manipulation , 2020, IEEE Robotics and Automation Letters.

[35]  Nigel W. Hardy,et al.  Knowledge Based Error Recovery in Industrial Robots , 1983, IJCAI.