Unified framework for automatic image stitching and rectification

Abstract. Conventional image stitching methods were developed under the assumption or condition that (1) the optical center of a camera is fixed (fixed-optical-center case) or (2) the camera captures a plane target (plane-target case). Hence, users should know or test which condition is more appropriate for the given set of images and then select a right algorithm or try multiple stitching algorithms. We propose a unified framework for the image stitching and rectification problem, which can handle both cases in the same framework. To be precise, we model each camera pose with six parameters (three for the rotation and three for the translation) and develop a cost function that reflects the registration errors on a reference plane. The designed cost function is effectively minimized via the Levenberg–Marquardt algorithm. For the given set of images, when it is found that the relative camera motions between the images are large, the proposed method performs rectification of images and then composition using the rectified images; otherwise, the algorithm simply builds a visually pleasing result by selecting a viewpoint. Experimental results on synthetic and real images show that our method successfully performs stitching and metric rectification.

[1]  Steven M. Seitz,et al.  Dynamic Mosaics , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[2]  Jorge J. Moré,et al.  The Levenberg-Marquardt algo-rithm: Implementation and theory , 1977 .

[3]  Dieter Schmalstieg,et al.  Real-time panoramic mapping and tracking on mobile phones , 2010, 2010 IEEE Virtual Reality Conference (VR).

[4]  Shengping Zhang,et al.  Dynamic image mosaic via SIFT and dynamic programming , 2013, Machine Vision and Applications.

[5]  Vincent Lepetit,et al.  Video-Based In Situ Tagging on Mobile Phones , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[7]  David S. Doermann,et al.  Camera-Based Document Image Mosaicing , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[8]  Gregory Dudek,et al.  Image stitching with dynamic elements , 2009, Image Vis. Comput..

[9]  Changhai Xu,et al.  3D pose estimation for planes , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[10]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[11]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  Maurizio Pilu,et al.  Extraction of illusory linear clues in perspectively skewed documents , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[14]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[15]  Andrew J. Davison,et al.  Real-Time Spherical Mosaicing Using Whole Image Alignment , 2010, ECCV.

[16]  Sohaib Khan,et al.  Shape from Angle Regularity , 2012, ECCV.

[17]  Majid Mirmehdi,et al.  Estimating the Orientation and Recovery of Text Planes in a Single Image , 2001, BMVC.

[18]  Pedro E. López-de-Teruel,et al.  Practical Planar Metric Rectification , 2006, BMVC.

[19]  Ezzeddine Zagrouba,et al.  An efficient image-mosaicing method based on multifeature matching , 2009, Machine Vision and Applications.

[20]  Nam Ik Cho,et al.  A new method to find an optimal warping function in image stitching , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Xu Liu,et al.  Document Image Mosaicing with Mobile Phones , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[22]  S. Umeyama,et al.  Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Dana Cobzas,et al.  3D SSD tracking with estimated 3D planes , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[24]  José Miguel Buenaposada,et al.  Real-time tracking and estimation of plane pose , 2002, Object recognition supported by user interaction for service robots.

[25]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .