Syddansk Universitet Temporal accumulation of oriented visual features

In this paper we present a framework for accumulating on-line a model of a moving object (e.g., when manipulated by a robot). The proposed scheme is based on Bayesian filtering of local features, filtering jointly position, orientation and appearance information. The work presented here is novel in two aspects: first, we use an estimation mechanism that updates iteratively not only geometrical information, but also appearance information. Second, we propose a probabilistic version of the classical n-scan criterion that allows us to select which features are preserved and which are discarded, while making use of the available uncertainty model. The accumulated representations have been used in three different contexts: pose estimation, robotic grasping, and driver assistance scenario. 2010 Elsevier Inc. All rights reserved.

[1]  Florentin Wörgötter,et al.  Reconstruction uncertainty and 3D relations , 2008 .

[2]  D. Reid An algorithm for tracking multiple targets , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[3]  O. Faugeras Three-Dimensional Computer Vision , 1993 .

[4]  A. T. Yang,et al.  Application of Dual-Number Quaternion Algebra to the Analysis of Spatial Mechanisms , 1964 .

[5]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[6]  Hugh F. Durrant-Whyte,et al.  Simultaneous Localization and Mapping with Sparse Extended Information Filters , 2004, Int. J. Robotics Res..

[7]  J. Y. S. Luh,et al.  Dual-number transformation and its applications to robotics , 1987, IEEE Journal on Robotics and Automation.

[8]  Bernd Jähne,et al.  BOOK REVIEW: Digital Image Processing, 5th revised and extended edition , 2002 .

[9]  Hai Tao,et al.  Object Tracking with Bayesian Estimation of Dynamic Layer Representations , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[11]  Frank Dellaert,et al.  MCMC-based particle filtering for tracking a variable number of interacting targets , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Andrew J. Davison,et al.  Active search for real-time vision , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[13]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[14]  David Nistér,et al.  Preemptive RANSAC for live structure and motion estimation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Frank Dellaert,et al.  MCMC Data Association and Sparse Factorization Updating for Real Time Multitarget Tracking with Merged and Multiple Measurements , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Eduardo Mario Nebot,et al.  Optimization of the simultaneous localization and map-building algorithm for real-time implementation , 2001, IEEE Trans. Robotics Autom..

[17]  Nando de Freitas,et al.  The Unscented Particle Filter , 2000, NIPS.

[18]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[19]  Norbert Krüger,et al.  Spatial-Temporal Junction Extraction and Semantic Interpretation , 2009, ISVC.

[20]  Ken Shoemake,et al.  Animating rotation with quaternion curves , 1985, SIGGRAPH.

[21]  Nicolas Pugeault,et al.  Early cognitive vision: feedback mechanisms for the disambiguation of early visual representation , 2008 .

[22]  Michel Dhome,et al.  3D reconstruction of complex structures with bundle adjustment: an incremental approach , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[23]  Simon Lacroix,et al.  Vision-Based SLAM: Stereo and Monocular Approaches , 2007, International Journal of Computer Vision.

[24]  Justus H. Piater,et al.  A Probabilistic Framework for 3D Visual Object Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Fergal Shevlin,et al.  Analysis of orientation problems using Plucker lines , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[26]  Ramakant Nevatia,et al.  Segmentation and Tracking of Multiple Humans in Crowded Environments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Norbert Krüger,et al.  A Three-level Architecture for Model-free Detection and Tracking of Independently Moving Objects , 2010, VISAPP.

[28]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[29]  Florentin Wörgötter,et al.  Rigid Body Motion in an Early Cognitive Vision Framework , 2006 .

[30]  Florentin Wörgötter,et al.  International Journal of Humanoid Robotics c ○ World Scientific Publishing Company Visual Primitives: Local, Condensed, Semantically Rich Visual Descriptors and their Applications in Robotics , 2022 .

[31]  Norbert Krüger,et al.  Multi-Modal Matching Applied to Stereo , 2003, BMVC.

[32]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[33]  Markus Lappe,et al.  Biologically Motivated Multi-modal Processing of Visual Primitives , 2003 .

[34]  Norbert Krüger,et al.  Comparison of Point and Line Features and Their Combination for Rigid Body Motion Estimation , 2009, Statistical and Geometrical Approaches to Visual Motion Analysis.