Estimating Poses of World's Photos with Geographic Metadata

Users can explore the world by viewing place related photos on Google Maps. One possible way is to take the nearby photos for viewing. However, for a given geo-location, many photos with view directions not pointing to the desired regions are returned by that world map. To address this problem, prior know the poses in terms of position and view direction of photos is a feasible solution. We can let the system return only nearby photos with view direction pointing to the target place, to facilitate the exploration of the place for users. Photo's view direction can be easily obtained if the extrinsic parameters of its corresponding camera are well estimated. Unfortunately, directly employing conventional methods for that is unfeasible since photos fallen into a range of certain radius centered at a place are observed be largely diverse in both content and view. Int this paper, we present a novel method to estimate the view directions of world's photos well. Then further obtain the pose referenced on Google Maps using the geographic Metadata of photos. The key point of our method is first generating a set of subsets when facing a large number of photos nearby a place, then reconstructing the scenes expressed by those subsets using normalized 8-point algorithm. We embed a search based strategy with scene alignment to product those subsets. We evaluate our method by user study on an online application developed by us, and the results show the effectiveness of our method.

[1]  Andrew J. Davison,et al.  Active Matching , 2008, ECCV.

[2]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[3]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[4]  Mads Nielsen,et al.  Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.

[5]  Antonio Torralba,et al.  SIFT Flow: Dense Correspondence across Different Scenes , 2008, ECCV.

[6]  Tat-Seng Chua,et al.  ViewFocus: explore places of interests on Google maps using photos with view direction filtering , 2009, MM '09.

[7]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[8]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[9]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Richard I. Hartley,et al.  In Defense of the Eight-Point Algorithm , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Olivier D. Faugeras,et al.  The fundamental matrix: Theory, algorithms, and stability analysis , 2004, International Journal of Computer Vision.

[12]  Onay Urfalioglu,et al.  Robust estimation of camera rotation,translation and focal length at high outlier rates , 2004, First Canadian Conference on Computer and Robot Vision, 2004. Proceedings..