Issues and Strategies for Robotic Object Manipulation in Domestic Settings

Many robotic tasks such as autonomous navigation,human-machine collaboration, object manipulationand grasping facilitate visual information. Some of themajor reasearch and system design issues in terms of visualsystems are robustness and flexibility.In this paper, we present a number of visual strategiesfor robotic object manipulation tasks in natural, domesticenvironments. Given a complex fetch-and-carry type oftasks, the issues related to the whole detect-approachgrasploop are considered. Our vision system integratesa number of algorithms using monocular and binocularcues to achieve robustness in realistic settings. The cuesare considered and used in connection to both foveal andperipheral vision to provide depth information, segmentthe object(s) of interest in the scene, object recognition,tracking and pose estimation. One important propertyof the system is that the step from object recognitionto pose estimation is completely automatic combiningboth appearance and geometric models. Rather thanconcentrating on the integration issues, our primary goalis to investigate the importance and effect of cameraconfiguration, their number and type, to the choice anddesign of the underlying visual algorithms. Experimentalevaluation is performed in a realistic indoor environmentwith occlusions, clutter, changing lighting and backgroundconditions.

[1]  C. Schmid,et al.  Indexing based on scale invariant interest points , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[2]  Jan-Olof Eklundh,et al.  A pure learning approach to background-invariant object recognition using pedagogical support vector learning , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Jan-Olof Eklundh,et al.  Real-Time Epipolar Geometry Estimation of Binocular Stereo Heads , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Selim Benhimane,et al.  Vision-based control with respect to planar and non-planar objects using a zooming camera , 2003 .

[5]  Mårten Björkman Real-Time Motion and Stereo Cues for Active Visual Observers , 2002 .

[6]  Danica Kragic,et al.  Real-time tracking meets online grasp planning , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  In-So Kweon,et al.  Robust model-based 3D object recognition by combining feature matching with tracking , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[9]  Roberto Cipolla,et al.  212D Visual Servoing with Respect to Planar Contours having Complex and Unknown Shapes , 2003, Int. J. Robotics Res..

[10]  Danica Kragic,et al.  Confluence of parameters in model based tracking , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[11]  Steve Vinoski,et al.  CORBA: integrating diverse applications within distributed heterogeneous environments , 1997, IEEE Commun. Mag..

[12]  H. C. Longuet-Higgins,et al.  The interpretation of a moving retinal image , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[13]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[14]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[15]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[16]  Kurt Konolige,et al.  Small Vision Systems: Hardware and Implementation , 1998 .

[17]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[18]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[19]  Danica Kragic,et al.  A Framework for Visual Servoing , 2003, ICVS.

[20]  Vladimir Kolmogorov,et al.  Computing visual correspondence with occlusions using graph cuts , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[21]  B. Ripley,et al.  Robust Statistics , 2018, Wiley Series in Probability and Statistics.

[22]  Peter I. Corke,et al.  A tutorial on visual servo control , 1996, IEEE Trans. Robotics Autom..

[23]  H. Barlow Vision Science: Photons to Phenomenology by Stephen E. Palmer , 2000, Trends in Cognitive Sciences.

[24]  Danica Kragic,et al.  Object recognition and pose estimation for robotic manipulation using color cooccurrence histograms , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[25]  Danica Kragic,et al.  Combination of foveal and peripheral vision for object recognition and pose estimation , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[26]  Danica Kragic,et al.  Weak models and cue integration for real-time tracking , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).