Dynamic 3D scene analysis for acquiring articulated scene models

In this paper we present a new system for a mobile robot to generate an articulated scene model by analyzing complex dynamic 3D scenes. The system extracts essential knowledge about the foreground, like moving persons, and the background, which consists of all visible static scene parts. In contrast to other 3D reconstruction approaches, we suggest to additionally distinguish between static parts, like walls, and movable objects like chairs or doors. The discrimination supports the reconstruction process and additionally, delivers important information about interaction objects. Here, the movable object detection is realized object independent by analyzing changes in the scenery. Furthermore, in the proposed system the background scene is feedbacked to the tracking part yielding a much better tracking and detection result which improves again the 3D reconstruction. We show in our experiments that we are able to provide a sound background model and to extract simultaneously persons and object regions representing chairs, doors, and even smaller movable objects.

[1]  Uwe Franke,et al.  6D-Vision: Fusion of Stereo and Motion for Robust Environment Perception , 2005, DAGM-Symposium.

[2]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[3]  Agnes Swadzba,et al.  Tracking objects in 6D for reconstructing static scenes , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  Roland Siegwart,et al.  Cognitive maps for mobile robots - an object based approach , 2007, Robotics Auton. Syst..

[5]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[6]  Michael R. Berthold,et al.  Intelligent Data Analysis , 2000, Springer Berlin Heidelberg.

[7]  Joachim Hertzberg,et al.  Towards semantic maps for mobile robots , 2008, Robotics Auton. Syst..

[8]  Ingemar J. Cox,et al.  A Maximum Likelihood Stereo Algorithm , 1996, Comput. Vis. Image Underst..

[9]  James W. Davis,et al.  Tracking mean shift clustered point clouds for 3D surveillance , 2006, VSSN '06.

[10]  Stefan Posch,et al.  Detection and Tracking of Moving Objects for Mosaic Image Generation , 2001, DAGM-Symposium.

[11]  Hugh F. Durrant-Whyte,et al.  Simultaneous Localization, Mapping and Moving Object Tracking , 2007, Int. J. Robotics Res..

[12]  John A. Bateman,et al.  Dialog-Based 3D-Image Recognition Using a Domain Ontology , 2006, Spatial Cognition.

[13]  Wolfram Burgard,et al.  Learning Kinematic Models for Articulated Objects , 2009, IJCAI.

[14]  Szymon Rusinkiewicz,et al.  Spacetime stereo: a unifying framework for depth from triangulation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ingo Lütkebohle,et al.  The curious robot - Structuring interactive robot learning , 2009, 2009 IEEE International Conference on Robotics and Automation.

[16]  Christian Wöhler,et al.  3D Scene Segmentation and Object Tracking in Multiocular Image Sequences , 2007, ICVS 2007.

[17]  Jan-Olof Eklundh,et al.  Statistical background subtraction for a mobile observer , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[18]  William Whittaker,et al.  Conditional particle filters for simultaneous mobile robot localization and people-tracking , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[19]  James W. Davis,et al.  Integrating Appearance and Motion Cues for Simultaneous Detection and Segmentation of Pedestrians , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20]  Roland Siegwart,et al.  A state-of-the-art 3D sensor for robot navigation , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).