View Selection with Geometric Uncertainty Modeling

Estimating positions of world points from features observed in images is a key problem in 3D reconstruction, image mosaicking,simultaneous localization and mapping and structure from motion. We consider a special instance in which there is a dominant ground plane $\mathcal{G}$ viewed from a parallel viewing plane $\mathcal{S}$ above it. Such instances commonly arise, for example, in aerial photography. Consider a world point $g \in \mathcal{G}$ and its worst case reconstruction uncertainty $\varepsilon(g,\mathcal{S})$ obtained by merging \emph{all} possible views of $g$ chosen from $\mathcal{S}$. We first show that one can pick two views $s_p$ and $s_q$ such that the uncertainty $\varepsilon(g,\{s_p,s_q\})$ obtained using only these two views is almost as good as (i.e. within a small constant factor of) $\varepsilon(g,\mathcal{S})$. Next, we extend the result to the entire ground plane $\mathcal{G}$ and show that one can pick a small subset of $\mathcal{S'} \subseteq \mathcal{S}$ (which grows only linearly with the area of $\mathcal{G}$) and still obtain a constant factor approximation, for every point $g \in \mathcal{G}$, to the minimum worst case estimate obtained by merging all views in $\mathcal{S}$. Finally, we present a multi-resolution view selection method which extends our techniques to non-planar scenes. We show that the method can produce rich and accurate dense reconstructions with a small number of views. Our results provide a view selection mechanism with provable performance guarantees which can drastically increase the speed of scene reconstruction algorithms. In addition to theoretical results, we demonstrate their effectiveness in an application where aerial imagery is used for monitoring farms and orchards.

[1]  Loong Fah Cheong,et al.  Effects of Errors in the Viewing Geometry on Shape Estimation , 1998, Comput. Vis. Image Underst..

[2]  Richard Szeliski,et al.  Towards Internet-scale multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Volkan Isler,et al.  Adaptive View Planning for Aerial 3D Reconstruction of Complex Scenes , 2018, ArXiv.

[4]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[5]  Daniel Nikovski,et al.  Submodular Function Maximization for Group Elevator Scheduling , 2017, ICAPS.

[6]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Volkan Isler,et al.  Large Scale Image Mosaic Construction for Agricultural Applications , 2016, IEEE Robotics and Automation Letters.

[8]  Richard Szeliski,et al.  Skeletal graphs for efficient structure from motion , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Luc Van Gool,et al.  An Integer Linear Programming Model for View Selection on Overlapping Camera Clusters , 2014, 2014 2nd International Conference on 3D Vision.

[10]  G. Roth,et al.  View planning for automated three-dimensional object reconstruction and inspection , 2003, CSUR.

[11]  Peter F. Sturm,et al.  A Factorization Based Algorithm for Multi-Image Projective Structure and Motion , 1996, ECCV.

[12]  Horst Bischof,et al.  Photogrammetric Camera Network Design for Micro Aerial Vehicles , 2012 .

[13]  Volkan Isler,et al.  Sensor Selection in Arbitrary Dimensions , 2008, IEEE Transactions on Automation Science and Engineering.

[14]  Kiriakos N. Kutulakos,et al.  Recovering shape by purposive viewpoint adjustment , 1992, International Journal of Computer Vision.

[15]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[16]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[17]  Robert Kaucic,et al.  Plane-based projective reconstruction , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[18]  Sang Wook Lee,et al.  View Selection Strategies for Multi-View, Wide-Baseline Stereo , 1994 .

[19]  Anup Basu,et al.  Analysis of Error in Depth Perception with Vergence and Spatially Varying Sensing , 1996, Comput. Vis. Image Underst..

[20]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[21]  Volkan Isler,et al.  Gathering Bearing Data for Target Localization , 2016, IEEE Robotics and Automation Letters.

[22]  Leif Kobbelt,et al.  Image selection for improved Multi-View Stereo , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Ruzena Bajcsy,et al.  Occlusions as a Guide for Planning the Next View , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Andreas Krause,et al.  Submodular Function Maximization , 2014, Tractability.

[25]  Mateu Sbert,et al.  Automatic View Selection Using Viewpoint Entropy and its Application to Image‐Based Modelling , 2003, Comput. Graph. Forum.