A deep perceptual metric for 3D point clouds

Point clouds are essential for storage and transmission of 3D content. As they can entail significant volumes of data, point cloud compression is crucial for practical usage. Recently, point cloud geometry compression approaches based on deep neural networks have been explored. In this paper, we evaluate the ability to predict perceptual quality of typical voxel-based loss functions employed to train these networks. We find that the commonly used focal loss and weighted binary cross entropy are poorly correlated with human perception. We thus propose a perceptual loss function for 3D point clouds which outperforms existing loss functions on the ICIP2020 subjective dataset. In addition, we propose a novel truncated distance field voxel grid representation and find that it leads to sparser latent spaces and loss functions that are more correlated with perceived visual quality compared to a binary representation. The source code is available at https://github.com/mauriceqch/2021_pc_perceptual_loss.

[1]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[2]  Giuseppe Valenzise,et al.  Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[3]  Raquel Urtasun,et al.  MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models , 2020, NeurIPS.

[4]  Fernando Pereira,et al.  Neighborhood Adaptive Loss Function for Deep Learning-Based Point Cloud Coding With Implicit and Explicit Quantization , 2021, IEEE MultiMedia.

[5]  Guillaume Lavoué,et al.  PC-MSDM: A quality metric for 3D point clouds , 2019, 2019 Eleventh International Conference on Quality of Multimedia Experience (QoMEX).

[6]  Frederic Dufaux,et al.  Improved Deep Point Cloud Geometry Compression , 2020, 2020 IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP).

[7]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  G. Zou Toward using confidence intervals to compare correlations. , 2007, Psychological methods.

[9]  Catarina Brites,et al.  Mahalanobis Based Point to Distribution Metric for Point Cloud Geometry Quality Evaluation , 2020, IEEE Signal Processing Letters.

[10]  Touradj Ebrahimi,et al.  Point Cloud Quality Assessment Metric Based on Angular Similarity , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[11]  Shishir Subramanyam,et al.  A Color-Based Objective Quality Metric for Point Cloud Contents , 2020, 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX).

[12]  Yinda Zhang,et al.  Deep Implicit Volume Compression , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Catarina Brites,et al.  Improving Psnr-Based Quality Metrics Performance For Point Cloud Geometry , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[16]  Touradj Ebrahimi,et al.  Towards neural network approaches for point cloud compression , 2020, Optical Engineering + Applications.

[17]  Rufael Mekuria,et al.  Emerging MPEG Standards for Point Cloud Compression , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[18]  R. Urtasun,et al.  OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Thomas Brox,et al.  Orientation-boosted Voxel Nets for 3D Object Recognition , 2016, BMVC.

[20]  Guillaume Lavoué,et al.  PCQM: A Full-Reference Quality Metric for Colored 3D Point Clouds , 2020, 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX).

[21]  Learned Point Cloud Geometry Compression , 2019, ArXiv.

[22]  Dong Tian,et al.  Geometric distortion metrics for point cloud compression , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[23]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[24]  Touradj Ebrahimi,et al.  JPEG Pleno: Toward an Efficient Representation of Visual Reality , 2016, IEEE MultiMedia.

[25]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[26]  Zhan Ma,et al.  Multiscale Point Cloud Geometry Compression , 2020, 2021 Data Compression Conference (DCC).

[27]  Simone Milani A Syndrome-Based Autoencoder For Point Cloud Geometry Compression , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[28]  Touradj Ebrahimi,et al.  Quality Evaluation Of Static Point Clouds Encoded Using MPEG Codecs , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[29]  Touradj Ebrahimi,et al.  Towards a Point Cloud Structural Similarity Metric , 2020, 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).