Compressed Voxel-Based Mapping Using Unsupervised Learning

In order to deal with the scaling problem of volumetric map representations we propose spatially local methods for high-ratio compression of 3D maps, represented as truncated signed distance fields. We show that these compres sed maps can be used as meaningful descriptors for selective decompression in scenarios relevant to robotic applicatio ns. As compression methods, we compare using PCA-derived lowdimensional bases to non-linear auto-encoder networks and novel mixed architectures that combine both. Selecting two application-oriented performance metrics, we evaluate the impact of different compression rates on reconstruction fidel ity as well as to the task of map-aided ego-motion estimation. It is demonstrated that lossily compressed distance fields use d as cost functions for ego-motion estimation, can outperfor m their uncompressed counterparts in challenging scenarios from standard RGB-D data-sets.

[1]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[2]  Jun Zhu,et al.  An Active Patch Model for Real World Texture and Appearance Classification , 2014, ECCV.

[3]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[4]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[5]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[6]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[7]  Andrew W. Fitzgibbon Robust registration of 2D and 3D point sets , 2003, Image Vis. Comput..

[8]  Alberto Elfes,et al.  Occupancy grids: a probabilistic framework for robot perception and navigation , 1989 .

[9]  Achim J. Lilienthal,et al.  SDF Tracker: A parallel algorithm for on-line pose estimation and scene reconstruction from depth images , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Daniel Cremers,et al.  Real-Time Camera Tracking and 3D Reconstruction Using Signed Distance Functions , 2013, Robotics: Science and Systems.

[11]  Richard A. Newcombe,et al.  Dense visual SLAM , 2012 .

[12]  Mark W. Jones Distance Field Compression , 2004, WSCG.

[13]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  John J. Leonard,et al.  Kintinuous: Spatially Extended KinectFusion , 2012, AAAI 2012.

[15]  Ming Zeng,et al.  A memory-efficient kinectfusion using octree , 2012, CVM'12.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Dinesh Manocha,et al.  Fast computation of generalized Voronoi diagrams using graphics hardware , 1999, SIGGRAPH.

[18]  Wolfram Burgard,et al.  Compact RGBD Surface Models Based on Sparse Coding , 2013, AAAI.

[19]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[20]  Marsette Vona,et al.  Moving Volume KinectFusion , 2012, BMVC.

[21]  Razvan Pascanu,et al.  Pylearn2: a machine learning research library , 2013, ArXiv.

[22]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[23]  John C. Hart,et al.  Sphere tracing: a geometric method for the antialiased ray tracing of implicit surfaces , 1996, The Visual Computer.

[24]  Arnulph Fuhrmann,et al.  Distance Fields for Rapid Collision Detection in Physically Based Modeling , 2003 .

[25]  Dieter Fox,et al.  DART: Dense Articulated Real-Time Tracking , 2014, Robotics: Science and Systems.

[26]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[27]  Ronald N. Perry,et al.  Adaptively sampled distance fields: a general representation of shape for computer graphics , 2000, SIGGRAPH.

[28]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.