Depth estimation in still images and videos using a motionless monocular camera

In this research we address the problem of depth estimation using a single motionless monocular camera. In our method we make no use of reference objects or marks in the image plane or on the ground apart from a one-off object used for horizon line detection; even this, however, is not necessary if a vanishing point detection algorithm is employed. Camera height is the only known parameter that is projected onto the image plane. Our algorithm has been tested using both a light calibrated and a non-calibrated camera and the results presented demonstrate that it works exceptionally well with both options. Our method promises to relax several assumptions and restrictions followed by state-of-the-art methods such as the height or width of the object of interest. Furthermore, our algorithm has been tested on still images as well as on videos using a background subtraction algorithm for automatic segmentation of foreground moving objects. The results obtained demonstrate our method is accurate and useful to a variety of applications from robot navigation to target tracking.

[1]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[2]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[3]  P. KaewTrakulPong,et al.  An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[4]  Kenneth Y. Goldberg,et al.  Visual tracking of human visitors under variable-lighting conditions for a responsive audio art installation , 2012, 2012 American Control Conference (ACC).

[5]  Ashutosh Saxena,et al.  Learning Depth from Single Monocular Images , 2005, NIPS.

[6]  Jean-Marc Odobez,et al.  Multi-Layer Background Subtraction Based on Color and Texture , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Carlos Hernández,et al.  Video-based, real-time multi-view stereo , 2011, Image Vis. Comput..

[8]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[9]  Moongu Jeon,et al.  A New Framework for Background Subtraction Using Multiple Cues , 2012, ACCV.

[10]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[11]  Aristodemos Pnevmatikakis,et al.  3D Tracking and Classification System Using a Monocular Camera , 2017, Wirel. Pers. Commun..

[12]  Ian D. Reid,et al.  Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[14]  Davide Scaramuzza,et al.  REMODE: Probabilistic, monocular dense reconstruction in real time , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Sotirios Ch. Diamantas Biological and metric maps applied to robot homing , 2010 .

[16]  Zezhi Chen,et al.  A method of visual metrology from uncalibrated images , 2006, Pattern Recognit. Lett..

[17]  I. Haritaoglu,et al.  Background and foreground modeling using nonparametric kernel density estimation for visual surveillance , 2002 .

[18]  Antonios Gasteratos,et al.  Stereovision-Based Algorithm for Obstacle Avoidance , 2009, ICIRA.

[19]  Lucia Maddalena,et al.  A fuzzy spatial coherence-based approach to background/foreground separation for moving object detection , 2010, Neural Computing and Applications.

[20]  Marko Heikkilä,et al.  A texture-based method for modeling the background and detecting moving objects , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Lucia Maddalena,et al.  A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications , 2008, IEEE Transactions on Image Processing.

[22]  Richard M. Crowder,et al.  Depth estimation for autonomous robot navigation: A comparative approach , 2010, 2010 IEEE International Conference on Imaging Systems and Techniques.

[23]  Richard M. Crowder,et al.  Depth computation using optical flow and least squares , 2010, 2010 IEEE/SICE International Symposium on System Integration.