A trifocal tensor based camera-projector system for robot-human interaction

In this paper we present a novel real time camera-projector system for assisting robot-human communication using natural interfaces. The system is comprised of a stereo camera pair and a DLP projector. The proposed system provides feedback information indicating the robot's understanding of the environment and what action a human user desires. Feedback is delivered by iteratively spotlighting objects in the environment using a projector. When the spotlight lands on an object that the human user wants the robot to retrieve, he or she can confirm the object selection, and the robot will move to a position suitable for grasping the object. In this investigation, the proposed camera-projector performs three tasks: 1) Locate salient objects in the scene using a visual attention model and a trifocal tensor based object matching method. 2) Project a spot of light on detected objects to provide information back to the human user. 3) Reconstruct the 3D position of the target to plan robot motion and conduct vision-based control to move a robot manipulator to the object. A calibration procedure for three view camera-projector system based on the trifocal tensor is presented. Experimental results of human-directed visual servoing task are presented to demonstrate the functions of the proposed system.

[1]  Seth Hutchinson,et al.  Visual Servo Control Part I: Basic Approaches , 2006 .

[2]  Heng-Tze Cheng,et al.  Imirok: Real-time imitative robotic arm control for home robot applications , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[3]  Andrew Zisserman,et al.  Multiple View Geometry , 1999 .

[4]  Ho-Sub Yoon,et al.  A Deconvolutive Neural Network for Speech Classification With Applications to Home Service Robot , 2010, IEEE Transactions on Instrumentation and Measurement.

[5]  Federico Tombari,et al.  Evaluation of stereo algorithms for 3D object recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[6]  Carlos Balaguer,et al.  Live experimentation of the service robot applications for elderly people care in home environments , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  J. W. Minett,et al.  Optimizing the P300-based brain–computer interface: current status, limitations and future directions , 2011, Journal of neural engineering.

[8]  Janusz Konrad,et al.  A gesture-driven computer interface using Kinect , 2012, 2012 IEEE Southwest Symposium on Image Analysis and Interpretation.

[9]  C.-F. Chen,et al.  Adaptive 3D Reconstruction system with improved recovery of miscoded region to automatically adjust projected light patterns , 2008, 2008 23rd International Conference Image and Vision Computing New Zealand.

[10]  Veronica Teichrieb,et al.  Poster: Improving motor rehabilitation process through a natural interaction based system using Kinect sensor , 2012, 2012 IEEE Symposium on 3D User Interfaces (3DUI).

[11]  Mark R. Pickering,et al.  Accurate depth estimation using structured light and passive stereo disparity estimation , 2011, 2011 18th IEEE International Conference on Image Processing.

[12]  N. Hurtós,et al.  Plane-based calibration of a projector-camera system , 2008 .

[13]  Gen Endo,et al.  Study on a practical robotic follower to support home oxygen therapy patients-development and control of a mobile platform , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  James J. Little,et al.  Informed visual search: Combining attention and object recognition , 2008, 2008 IEEE International Conference on Robotics and Automation.

[15]  Richard M. Murray,et al.  A Mathematical Introduction to Robotic Manipulation , 1994 .

[16]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects using Vision , 2008, Int. J. Robotics Res..

[17]  Danica Kragic,et al.  Enhanced visual scene understanding through human-robot dialog , 2010, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Roozbeh Jafari,et al.  Spectral Spatio-Temporal template extraction from EEG signals , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[19]  Danica Kragic,et al.  Attention-based active 3D point cloud segmentation , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  François Chaumette,et al.  Visual servo control. I. Basic approaches , 2006, IEEE Robotics & Automation Magazine.

[21]  Danica Kragic,et al.  Combination of foveal and peripheral vision for object recognition and pose estimation , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[22]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[23]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.