A combined vision-inertial fusion approach for 6-DoF object pose estimation

The estimation of the 3D position and orientation of moving objects (‘pose’ estimation) is a critical process for many applications in robotics, computer vision or mobile services. Although major research efforts have been carried out to design accurate, fast and robust indoor pose estimation systems, it remains as an open challenge to provide a low-cost, easy to deploy and reliable solution. Addressing this issue, this paper describes a hybrid approach for 6 degrees of freedom (6-DoF) pose estimation that fuses acceleration data and stereo vision to overcome the respective weaknesses of single technology approaches. The system relies on COTS technologies (standard webcams, accelerometers) and printable colored markers. It uses a set of infrastructure cameras, located to have the object to be tracked visible most of the operation time; the target object has to include an embedded accelerometer and be tagged with a fiducial marker. This simple marker has been designed for easy detection and segmentation and it may be adapted to different service scenarios (in shape and colors). Experimental results show that the proposed system provides high accuracy, while satisfactorily dealing with the real-time constraints.

[1]  Farid Golnaraghi,et al.  A Kalman/Particle Filter-Based Position and Orientation Estimation Method Using a Position Sensor/Inertial Measurement Unit Hybrid System , 2010, IEEE Transactions on Industrial Electronics.

[2]  D. W. F. van Krevelen,et al.  A Survey of Augmented Reality Technologies, Applications and Limitations , 2010, Int. J. Virtual Real..

[3]  Lars Asplund,et al.  An embedded stereo vision module for 6D pose estimation and mapping , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Mohan M. Trivedi,et al.  3-D Posture and Gesture Recognition for Interactivity in Smart Spaces , 2012, IEEE Transactions on Industrial Informatics.

[5]  Zhengyou Zhang,et al.  Flexible camera calibration by viewing a plane from unknown orientations , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Minh-Triet Tran,et al.  Overlay upper Clothing Textures to Still Images based on Human Pose Estimation , 2014, J. Mobile Multimedia.

[7]  Heiko Hirschmüller,et al.  Stereo vision and IMU based real-time ego-motion and depth image computation on a handheld device , 2013, 2013 IEEE International Conference on Robotics and Automation.

[8]  Jihad El-Sana,et al.  Shape recognition and pose estimation for mobile augmented reality , 2009, ISMAR.

[9]  Roland Siegwart,et al.  Fusion of IMU and Vision for Absolute Scale Estimation in Monocular SLAM , 2011, J. Intell. Robotic Syst..

[10]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[11]  Ana M. Bernardos,et al.  A System to Enable Level-of-Detail Mobile Interaction with Augmented Media Objects , 2014, 2014 Eighth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing.

[12]  Miguel A. Olivares-Méndez,et al.  On-board and Ground Visual Pose Estimation Techniques for UAV Control , 2011, J. Intell. Robotic Syst..

[13]  Mohan M. Trivedi,et al.  Head Pose Estimation in Computer Vision: A Survey , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Xiaoming Hu,et al.  Autocalibration of an electronic compass for augmented reality , 2005, Fourth IEEE and ACM International Symposium on Mixed and Augmented Reality (ISMAR'05).

[15]  Wookho Son,et al.  Efficient mobile museum guidance system using augmented reality , 2008, 2008 IEEE International Symposium on Consumer Electronics.

[16]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..

[17]  Andreas Riener,et al.  Head-Pose-Based Attention Recognition on Large Public Displays , 2014, IEEE Computer Graphics and Applications.

[18]  Nico Blodow,et al.  CAD-model recognition and 6DOF pose estimation using 3D cues , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[19]  Vitor Santos,et al.  Object recognition and pose estimation for industrial applications: A cascade system , 2014 .

[20]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[21]  Georgios S. Paschos,et al.  Perceptually uniform color spaces for color texture analysis: an empirical evaluation , 2001, IEEE Trans. Image Process..

[22]  Robert Schmitt,et al.  Estimation of the absolute camera pose for environment recognition of industrial robotics , 2013, Prod. Eng..