Iterative Object Localization Algorithm Using Visual Images with a Reference Coordinate

We present a simplified algorithm for localizing an object using multiple visual images that are obtained from widely used digital imaging devices. We use a parallel projection model which supports both zooming and panning of the imaging devices. Our proposed algorithm is based on a virtual viewable plane for creating a relationship between an object position and a reference coordinate. The reference point is obtained from a rough estimate which may be obtained from the preestimation process. The algorithm minimizes localization error through the iterative process with relatively low-computational complexity. In addition, nonlinearity distortion of the digital image devices is compensated during the iterative process. Finally, the performances of several scenarios are evaluated and analyzed in both indoor and outdoor environments.

[1]  Mubarak Shah,et al.  Camera handoff: tracking in multiple uncalibrated stationary cameras , 2000, Proceedings Workshop on Human Motion.

[2]  John W. McDonough,et al.  A joint particle filter for audio-visual speaker tracking , 2005, ICMI '05.

[3]  Luc Van Gool,et al.  Color-Based Object Tracking in Multi-camera Environments , 2003, DAGM-Symposium.

[4]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  I. Kaminer,et al.  A three point algorithm for attitude and range determination using vision , 2000, Proceedings of the 2000 American Control Conference. ACC (IEEE Cat. No.00CH36334).

[6]  Pascal Fua,et al.  Robust People Tracking with Global Trajectory Optimization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Frédéric Lerasle,et al.  Visual localization of a mobile robot in indoor environments using planar landmarks , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[8]  Yoshiaki Shirai,et al.  Object tracking based on optical flow and depth , 1996, 1996 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Integration for Intelligent Systems (Cat. No.96TH8242).

[9]  Kostas Daniilidis,et al.  Omnidirectional video , 2003, The Visual Computer.

[10]  Paul A. Beardsley,et al.  Affine Calibration of Mobile Vehicles , 1995 .

[11]  Vincent Lepetit,et al.  Monocular Model-Based 3D Tracking of Rigid Objects: A Survey , 2005, Found. Trends Comput. Graph. Vis..

[12]  Youngjin Choi,et al.  Simple visual self-localization for indoor mobile robots using single video camera , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[13]  Darren B. Ward,et al.  Particle filtering algorithms for tracking an acoustic source in a reverberant environment , 2003, IEEE Trans. Speech Audio Process..

[14]  Roger Y. Tsai,et al.  Techniques for Calibration of the Scale Factor and Image Center for High Accuracy 3-D Machine Vision Metrology , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[16]  Luc Van Gool,et al.  Affine Reconstruction from Perspective Image Pairs with a Relative Object-Camera Translation in Between , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Rachid Deriche,et al.  A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry , 1995, Artif. Intell..

[18]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[19]  Mei Han,et al.  A detection-based multiple object tracking method , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[20]  Paul A. Beardsley,et al.  Sequential Updating of Projective and Affine Structure from Motion , 1997, International Journal of Computer Vision.

[21]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, ECCV.

[22]  Stephen J. Maybank,et al.  On plane-based camera calibration: A general algorithm, singularities, applications , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[23]  Ramakant Nevatia,et al.  Camera calibration from video of a walking human , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  O. Faugeras Stratification of three-dimensional vision: projective, affine, and metric representations , 1995 .

[25]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[26]  Mubarak Shah,et al.  Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Sascha Spors,et al.  A Multi-Sensor Object Localization System , 2001, VMV.

[28]  I. Reid,et al.  Metric calibration of a stereo rig , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[29]  S. Bougnoux,et al.  From projective to Euclidean space under any practical situation, a criticism of self-calibration , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[30]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Yoshiaki Shirai,et al.  Optical flow-based person tracking by multiple cameras , 2001, Conference Documentation International Conference on Multisensor Fusion and Integration for Intelligent Systems. MFI 2001 (Cat. No.01TH8590).

[32]  Gopal Sarma Pingali,et al.  Audio-visual tracking for natural interactivity , 1999, MULTIMEDIA '99.

[33]  Long Quan,et al.  Affine stereo calibration , 1995, CAIP.

[34]  M.D. Naish,et al.  Active-vision-based multisensor surveillance - an implementation , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[35]  Larry S. Davis,et al.  Joint Audio-Visual Tracking Using Particle Filters , 2002, EURASIP J. Adv. Signal Process..

[36]  O. D. Faugeras,et al.  Camera Self-Calibration: Theory and Experiments , 1992, ECCV.

[37]  Shree K. Nayar,et al.  Telecentric Optics for Computational Vision , 1996, ECCV.

[38]  Huang Lee,et al.  Collaborative node localization in surveillance networks using opportunistic target observations , 2006, VSSN '06.

[39]  Gang Qian,et al.  Robust Multi-Camera 3D People Tracking with Partial Occlusion Handling , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[40]  R. Lienhart,et al.  Calibration of visual sensors and actuators in distributed computing platforms , 2005, VSSN '05.

[41]  Sohaib Khan,et al.  Camera calibration and three-dimensional world reconstruction of stereo-vision using neural networks , 2001, Int. J. Syst. Sci..

[42]  Janne Heikkilä,et al.  A four-step camera calibration procedure with implicit image correction , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[43]  Paul Debevec,et al.  Modeling and Rendering Architecture from Photographs , 1996, SIGGRAPH 1996.

[44]  Roberto Cipolla,et al.  Camera Calibration from Vanishing Points in Image of Architectural Scenes , 1999, BMVC.

[45]  Luc Van Gool,et al.  A stratified approach to metric self-calibration , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.