Bin Picking Using Manifold Learning

Bin picking using vision based sensors requires accurate estimation of location and pose of the object for positioning the end effector of the robotic arm. The computational burden and complexity depends upon the parametric model adopted for the task. Learning based techniques to implement the scheme using low dimensional manifolds offer computationally more efficient alternatives. In this paper we have employed Locally Linear Embedding (LLE) and Deep Learning (with auto encoders) for manifold learning in the visual domain as well as for the parameters of robotic manipulator for visual servoing. Images of clusters of cylindrical pellets were used as the training data set in the visual domain. Corresponding parameters of the six degrees of freedom robot for picking designated cylindrical pellet formed the training dataset in the robotic configuration space. The correspondence between the weight coefficients of LLE manifold in the visual domain and robotic domain is established through regression. Autoencoders in conjunction with feed forward neural networks were used for learning of correspondence between the high dimensional visual space and low dimensional configuration space. We have compared the results of the two implementations for the same dataset and found that manifold learning using auto encoders resulted in better performance. The eye-in-hand configuration used with KUKA KR5 robotic arm and Basler camera offers a potentially effective and efficient solution to the bin picking problem through learning based visual servoing.

[1]  Jeff L. Edwards An active, appearance-based approach to the pose estimation of complex objects , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[2]  Martin Jägersand,et al.  Learning based visual servoing , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[3]  David Vernon,et al.  Pose estimation for objects with planar surfaces using eigenimage and range data analysis , 2007, Machine Vision and Applications.

[4]  Robert B. Kelley,et al.  A Robot System Which Acquires Cylindrical Workpieces from Bins , 1982, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[6]  Paul F. Whelan,et al.  A bin picking system based on depth from defocus , 2003, Machine Vision and Applications.

[7]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[8]  M. Hebert,et al.  The Representation, Recognition, and Locating of 3-D Objects , 1986 .

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Yoshua Bengio,et al.  Joint Training of Deep Boltzmann Machines , 2012, ArXiv.

[11]  L. Saul,et al.  An Introduction to Locally Linear Embedding , 2001 .

[12]  A. Gasteratos,et al.  Pose manifolds for efficient visual servoing , 2012, 2012 IEEE International Conference on Imaging Systems and Techniques Proceedings.

[13]  Avinash C. Kak,et al.  Calculating the 3d-pose of rigid-objects using active appearance models , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.