Neural 3D Scene Compression via Model Compression

Rendering 3D scenes requires access to arbitrary viewpoints from the scene. Storage of such a 3D scene can be done in two ways; (1) storing 2D images taken from the 3D scene that can reconstruct the scene back through interpolations, or (2) storing a representation of the 3D scene itself that already encodes views from all directions. So far, traditional 3D compression methods have focused on the first type of storage and compressed the original 2D images with image compression techniques. With this approach, the user first decodes the stored 2D images and then renders the 3D scene. However, this separated procedure is inefficient since a large amount of 2D images have to be stored. In this work, we take a different approach and compress a functional representation of 3D scenes. In particular, we introduce a method to compress 3D scenes by compressing the neural networks that represent the scenes as neural radiance fields. Our method provides more efficient storage of 3D scenes since it does not store 2D images – which are redundant when we render the scene from the neural functional representation.1

[1]  Ulrich Neumann,et al.  Compression of computer graphics images with image-based rendering , 1998, Electronic Imaging.

[2]  Ioan Tabus,et al.  WaSP: Hierarchical Warping, Merging, and Sparse Prediction for Light Field Image Compression , 2018, 2018 7th European Workshop on Visual Information Processing (EUVIP).

[3]  Pratul P. Srinivasan,et al.  NeRF , 2020, ECCV.

[4]  Andreas Nüchter,et al.  3D point cloud compression using conventional image compression for efficient data transmission , 2015, 2015 XXV International Conference on Information, Communication and Automation Technologies (ICAT).

[5]  Richard A. Newcombe,et al.  DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Tsachy Weissman,et al.  Neural Network Compression for Noisy Storage Devices , 2021, ArXiv.

[7]  Xinfeng Zhang,et al.  Light Field Image Compression Using Generative Adversarial Network-Based View Synthesis , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[8]  Petros Daras,et al.  On human Time-Varying Mesh compression exploiting activity-related characteristics , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Tao Zhang,et al.  A Survey of Model Compression and Acceleration for Deep Neural Networks , 2017, ArXiv.

[10]  Tsachy Weissman,et al.  Successive Pruning for Model Compression via Rate Distortion Theory , 2021, ArXiv.

[11]  Rufael Mekuria,et al.  Emerging MPEG Standards for Point Cloud Compression , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[12]  Gordon Wetzstein,et al.  State of the Art on Neural Rendering , 2020, Comput. Graph. Forum.

[13]  Silvio Savarese,et al.  3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.

[14]  Erich Elsen,et al.  The State of Sparsity in Deep Neural Networks , 2019, ArXiv.

[15]  Sebastian Nowozin,et al.  Occupancy Networks: Learning 3D Reconstruction in Function Space , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Dmitry P. Vetrov,et al.  Variational Dropout Sparsifies Deep Neural Networks , 2017, ICML.

[17]  Davis Pan,et al.  A Tutorial on MPEG/Audio Compression , 1995, IEEE Multim..

[18]  Craig Gotsman,et al.  Triangle Mesh Compression , 1998, Graphics Interface.

[19]  Jonathan T. Barron,et al.  Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains , 2020, NeurIPS.

[20]  Charles T. Loop,et al.  Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Jan Platos,et al.  Word-Based Text Compression , 2008, ArXiv.

[22]  Maja Krivokuca,et al.  A Volumetric Approach to Point Cloud Compression—Part I: Attribute Compression , 2020, IEEE Transactions on Image Processing.

[23]  Yoav Y. Schechner,et al.  Image compression optimized for 3D reconstruction by utilizing deep neural networks , 2020, J. Vis. Commun. Image Represent..

[24]  Adrian S. Lewis,et al.  Image compression using the 2-D wavelet transform , 1992, IEEE Trans. Image Process..

[25]  Mohammed M. Siddeq,et al.  A Novel Image Compression Algorithm for High Resolution 3D Reconstruction , 2014 .

[26]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[27]  Song Han,et al.  AMC: AutoML for Model Compression and Acceleration on Mobile Devices , 2018, ECCV.

[28]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[29]  Song Han,et al.  Learning both Weights and Connections for Efficient Neural Network , 2015, NIPS.

[30]  Simon Fuhrmann,et al.  MVE - A Multi-View Reconstruction Environment , 2014, GCH.

[31]  Maja Krivokuca,et al.  A Volumetric Approach to Point Cloud Compression–Part II: Geometry Compression , 2019, IEEE Transactions on Image Processing.

[32]  Xinfeng Zhang,et al.  Light Field Image Compression Based on Deep Learning , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[33]  Jarek Rossignac,et al.  Edgebreaker: Connectivity Compression for Triangle Meshes , 1999, IEEE Trans. Vis. Comput. Graph..

[34]  C.-C. Jay Kuo,et al.  Technologies for 3D mesh compression: A survey , 2005, J. Vis. Commun. Image Represent..

[35]  Nico Blodow,et al.  Real-time compression of point cloud streams , 2012, 2012 IEEE International Conference on Robotics and Automation.

[36]  Konstantinos Konstantinides,et al.  Image and Video Compression Standards: Algorithms and Architectures , 1997 .

[37]  Gordon Wetzstein,et al.  Implicit Neural Representations with Periodic Activation Functions , 2020, NeurIPS.

[38]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[39]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.