Fast camera motion estimation for hand-held devices and applications

In this paper we present an efficient motion estimation algorithm for camera-enabled handheld devices, such as mobile phones and PDAs. Compared to general camera motion estimation, the estimation of ego-motion of handheld devices presents unique challenges because most devices are limited in resources (processing power, memory and battery). The algorithm must be lightweight so it can be efficiently embedded and run fast enough to produce smooth seamless translation. Our solution includes a multi-resolution scheme which searches the match from coarse to fine; and optimization of the search space. As a demonstration, we implemented the algorithm on the Symbian based Nokia 3650/6600 camera phone, and explored several interesting applications, including using camera motion for browsing documents and as a pointing device. A cross-platform implementation is also discussed.

[1]  Simon Lacroix,et al.  Position estimation in outdoor environments using pixel tracking and stereovision , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[2]  Brad A. Myers The pebbles project: using PCs and hand-held computers together , 2000, CHI Extended Abstracts.

[3]  Michael Brady,et al.  Practical Structure and Motion from Stereo When Motion is Unconstrained , 2000, International Journal of Computer Vision.

[4]  Carlo Tomasi,et al.  Comparison of approaches to egomotion computation , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  S. Shankar Sastry,et al.  Omnidirectional Egomotion Estimation From Back-projection Flow , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[6]  Janne Heikkilä,et al.  A Vision-Based Approach for Controlling User Interfaces of Mobile Devices , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[7]  Ronen Basri,et al.  Fast multiscale image segmentation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[8]  David S. Doermann,et al.  Building mosaics from video using MPEG motion vectors , 1999, MULTIMEDIA '99.

[9]  Andrew W. Fitzgibbon,et al.  Simultaneous linear estimation of multiple view geometry and lens distortion , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[11]  Takeo Kanade,et al.  Transforming camera geometry to a virtual downward-looking camera: robust ego-motion estimation and ground-layer detection , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12]  Rolf-Rainer Grigat,et al.  Real-time hybrid pose estimation from vision and inertial data , 2004, First Canadian Conference on Computer and Robot Vision, 2004. Proceedings..

[13]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.