Calibration-Free Augmented Reality

Camera calibration and the acquisition of Euclidean 3D measurements have so far been considered necessary requirements for overlaying three-dimensional graphical objects with live video. We describe a new approach to video-based augmented reality that avoids both requirements: it does not use any metric information about the calibration parameters of the camera or the 3D locations and dimensions of the environment's objects. The only requirement is the ability to track across frames at least four fiducial points that are specified by the user during system initialization and whose world coordinates are unknown. Our approach is based on the following observation: given a set of four or more noncoplanar 3D points, the projection of all points in the set can be computed as a linear combination of the projections of just four of the points. We exploit this observation by: tracking regions and color fiducial points at frame rate; and representing virtual objects in a non-Euclidean, affine frame of reference that allows their projection to be computed as a linear combination of the projection of the fiducial points. Experimental results on two augmented reality systems, one monitor-based and one head-mounted, demonstrate that the approach is readily implementable, imposes minimal computational and hardware requirements, and generates real-time and accurate video overlays even when the camera parameters vary dynamically.

[1]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[2]  Michael Brady,et al.  On the Appropriateness of Camera Models , 1996, ECCV.

[3]  Takeo Kanade,et al.  Vision-Based Object Registration for Real-Time Image Overlay , 1995, CVRMed.

[4]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[5]  W. Eric L. Grimson,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Lawrence O'Gorman,et al.  Subpixel Precision of Straight-Edged Shapes for Registration and Measurement , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Shree K. Nayar,et al.  Catadioptric omnidirectional camera , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Charles R. Dyer,et al.  Real-time motion tracking of three-dimensional objects , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[9]  Zhigang Xiang,et al.  Moving cursor plane for interactive sculpting , 1996, TOGS.

[10]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[11]  Demetri Terzopoulos,et al.  Real-time computer vision , 1995 .

[12]  Andrew Zisserman,et al.  Applications of Invariance in Computer Vision , 1993, Lecture Notes in Computer Science.

[13]  KanadeTakeo,et al.  Shape and motion from image streams under orthography , 1992 .

[14]  Daphna Weinshall,et al.  Linear and incremental acquisition of invariant shape models from image sequences , 1993, 1993 (4th) International Conference on Computer Vision.

[15]  Emanuele Trucco,et al.  Geometric Invariance in Computer Vision , 1995 .

[16]  Alex Pentland,et al.  A novel environment for situated vision and behavior , 1994 .

[17]  O. Faugeras Stratification of three-dimensional vision: projective, affine, and metric representations , 1995 .

[18]  Arie E. Kaufman,et al.  Volume sculpting , 1995, I3D '95.

[19]  J. P. Mellor,et al.  Enhanced Reality Visualization in a Surgical Environment , 1995 .

[20]  Ulrich Neumann,et al.  Dynamic registration correction in video-based augmented reality systems , 1995, IEEE Computer Graphics and Applications.

[21]  Thomas Ertl,et al.  Computer Graphics - Principles and Practice, 3rd Edition , 2014 .

[22]  Ming Ouhyoung,et al.  A 3D tracking experiment on latency and its compensation methods in virtual environments , 1995, UIST '95.

[23]  Ian D. Reid,et al.  Recursive Affine Structure and Motion from Image Sequences , 1994, ECCV.

[24]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Kiriakos N. Kutulakos,et al.  Affine object representations for calibration-free augmented reality , 1996, Proceedings of the IEEE 1996 Virtual Reality Annual International Symposium.

[26]  Matthias M. Wloka,et al.  Resolving occlusion in augmented reality , 1995, I3D '95.

[27]  .. McGlone,et al.  Vision-Based Object Registration for Real-Time Image Overlay , 1995 .

[28]  Michael Gleicher,et al.  Through-the-lens camera control , 1992, SIGGRAPH.

[29]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[30]  D. W. Thompson,et al.  Three-dimensional model matching from an unconstrained viewpoint , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[31]  A. Shashua Geometry and Photometry in 3D Visual Recognition , 1992 .

[32]  Richard L. Holloway,et al.  Registration errors in augmented reality systems , 1996 .

[33]  Michael Werman,et al.  Shape from motion algorithms: a comparative analysis of scaled orthography and perspective , 1994, ECCV.

[34]  Alex Pentland,et al.  Visually Controlled Graphics , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Eamon B. Barrett,et al.  Some invariant linear methods in photogrammetry and model-matching , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[37]  Mary C. Whitton,et al.  Technologies for augmented reality systems: realizing ultrasound-guided needle biopsies , 1996, SIGGRAPH.

[38]  Gregory D. Hager,et al.  Calibration-free visual control using projective invariance , 1995, Proceedings of IEEE International Conference on Computer Vision.

[39]  Y. Bar-Shalom Tracking and data association , 1988 .

[40]  Paul Debevec,et al.  Modeling and Rendering Architecture from Photographs , 1996, SIGGRAPH 1996.

[41]  M. Carter Computer graphics: Principles and practice , 1997 .

[42]  FeinerSteven,et al.  Knowledge-based augmented reality , 1993 .

[43]  Keith Jack,et al.  Video Demystified: A Handbook for the Digital Engineer , 1993 .

[44]  Amnon Shashua,et al.  A geometric invariant for visual recognition and 3D reconstruction from two perspective/orthographic views , 1993, [1993] Proceedings IEEE Workshop on Qualitative Vision.

[45]  Tomas Lozano-Perez,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1996 .

[46]  Steven M. Seitz,et al.  Complete scene structure from four point correspondences , 1995, Proceedings of IEEE International Conference on Computer Vision.

[47]  Roger Mohr,et al.  Accurate Projective Reconstruction , 1993, Applications of Invariance in Computer Vision.

[48]  Pierre David Wellner,et al.  Interacting with paper on the DigitalDesk , 1993, CACM.

[49]  Ronald Azuma,et al.  Improving static and dynamic registration in an optical see-through HMD , 1994, SIGGRAPH.

[50]  Yehezkel Lamdan,et al.  Object recognition by affine invariant matching , 2011, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[51]  David E. Breen,et al.  Calibration Requirements and Procedures for a Monitor-Based Augmented Reality System , 1995, IEEE Trans. Vis. Comput. Graph..

[52]  Mark A. Livingston,et al.  Superior augmented reality registration by integrating landmark tracking and magnetic tracking , 1996, SIGGRAPH.

[53]  W. Eric L. Grimson,et al.  Evaluating and Validating an Automated Registration System for Enhanced Reality Visualization in Surgery , 1995, CVRMed.

[54]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[55]  Gregory D. Hager,et al.  Incremental focus of attention for robust visual tracking , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56]  Michael Isard,et al.  3D position, attitude and shape input using video tracking of hands and lips , 1994, SIGGRAPH.

[57]  James Arvo,et al.  Interactive design of complex time dependent lighting , 1995, IEEE Computer Graphics and Applications.

[58]  Andrew Zisserman,et al.  Geometric invariance in computer vision , 1992 .

[59]  Roberto Cipolla,et al.  Uncalibrated Stereo Vision with Pointing for a Man-Machine Interface , 1994, MVA.

[60]  Ronald Azuma,et al.  A Survey of Augmented Reality , 1997, Presence: Teleoperators & Virtual Environments.

[61]  Bruce A. Draper,et al.  Adaptive tracking and model registration across distinct aspects , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[62]  S. Ullman,et al.  Geometry and photometry in three-dimensional visual recognition , 1993 .

[63]  Amnon Shashua,et al.  Projective depth: A geometric invariant for 3D reconstruction from two perspective/orthographic views and for visual recognition , 1993, 1993 (4th) International Conference on Computer Vision.

[64]  David J. Kriegman,et al.  What is the set of images of an object under all possible lighting conditions? , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[65]  Ronald Azuma,et al.  A survey of augmented reality" Presence: Teleoperators and virtual environments , 1997 .

[66]  Hong Qin,et al.  D-NURBS: A Physics-Based Framework for Geometric Design , 1996, IEEE Trans. Vis. Comput. Graph..

[67]  Steven K. Feiner,et al.  Knowledge-based augmented reality , 1993, CACM.

[68]  Marc Levoy,et al.  Zippered polygon meshes from range images , 1994, SIGGRAPH.

[69]  Marc Levoy,et al.  3D painting on scanned surfaces , 1995, I3D '95.

[70]  S. James Zinreich,et al.  An optical 3D digitizer for frameless stereotactic surgery , 1996, IEEE Computer Graphics and Applications.

[71]  Gregory D. Hager,et al.  Real-time tracking of image regions with changes in geometry and illumination , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[72]  Ryutarou Ohbuchi,et al.  Merging virtual objects with the real world: seeing ultrasound imagery within the patient , 1992, SIGGRAPH.

[73]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[74]  FuchsHenry,et al.  Merging virtual objects with the real world , 1992 .

[75]  T. P. Caudell,et al.  Augmented reality: an application of heads-up display technology to manual manufacturing processes , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.