K-SVD Based Point Cloud Coding for RGB-D Video Compression Using 3D Super-Point Clustering

In this paper, we present a novel 3D structure-awareness RGB-D video compression scheme, which applies the proposed 3D super-point clustering to partition the super-points in a colored point cloud, generated from an RGB-D image, into a centroid and a non-centroid super-point datasets. A super-point is a set of 3D points which are characterized with similar feature vectors. Input an RGB-D frame to the proposed scheme, the camera parameters are first used to generate a colored point cloud, which is segmented into multiple super-points using our multiple principal plane analysis (MPPA). These super-points are then grouped into multiple clusters, each of them characterized by a centroid super-point. Next, the median feature vectors of super-points are represented by the K singular value decomposition (K-SVD) based sparse codes. Given a super-point cluster, the sparse codes of the median feature vectors are very similar and thus the redundant information among them are easy to remove by the successive entropy coding. For each super-point, the residual super-point is computed by subtracting the feature vectors inside from the reconstructed median feature vector. These residual feature vectors are also collected and coded using the K-SVD based sparse coding to enhance the quality of the compressed point cloud. This process results in a multiple description coding scheme for 3D point cloud compression. Finally, the compressed point cloud is projected to the 2D image space to obtain the compressed RGB-D image. Experiments demonstrate the effectiveness of our approach which attains better performance than the current state-of-the-art point cloud compression methods.

[1]  Xiaoqin Wang,et al.  Fast Depth Video Compression for Mobile RGB-D Sensors , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[3]  Li Liu,et al.  KSVD-Based Multiple Description Image Coding , 2019, IEEE Access.

[4]  Erik Schaffernicht,et al.  Compressed Voxel-Based Mapping Using Unsupervised Learning , 2017, Robotics.

[5]  Chin-Chun Chang,et al.  Deformable shape finding with models based on kernel methods , 2006, IEEE Transactions on Image Processing.

[6]  Shyi-Chyi Cheng,et al.  Model-Based 3D Pose Estimation of a Single RGB Image Using a Deep Viewpoint Classification Neural Network , 2019, Applied Sciences.

[7]  Matthias Nießner,et al.  State of the Art on 3D Reconstruction with RGB‐D Cameras , 2018, Comput. Graph. Forum.

[8]  Jeffrey Scott Vitter,et al.  Arithmetic coding for data compression , 1994 .

[9]  Ming Zeng,et al.  A memory-efficient kinectfusion using octree , 2012, CVM'12.

[10]  Shyi-Chyi Cheng,et al.  A novel 3D mesh compression using mesh segmentation with multiple principal plane analysis , 2010, Pattern Recognit..

[11]  Pascal Frossard,et al.  Graph-Based Compression of Dynamic 3D Point Cloud Sequences , 2015, IEEE Transactions on Image Processing.

[12]  Marcus A. Magnor,et al.  RGB-guided depth map compression via Compressed Sensing and Sparse Coding , 2015, 2015 Picture Coding Symposium (PCS).

[13]  Rufael Mekuria,et al.  Design, Implementation, and Evaluation of a Point Cloud Codec for Tele-Immersive Video , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Yuxiang Sun,et al.  A Novel Point Cloud Compression Algorithm Based on Clustering , 2019, IEEE Robotics and Automation Letters.

[15]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[16]  A. Wyner,et al.  Source coding for multiple descriptions , 1980, The Bell System Technical Journal.

[17]  Hanjun Kim,et al.  Robust 3-D Object Reconstruction Based on Camera Clustering With Geodesic Distance , 2019, IEEE/ASME Transactions on Mechatronics.

[18]  Yao Zhao,et al.  Multiple Description Coding With Randomly and Uniformly Offset Quantizers , 2014, IEEE Transactions on Image Processing.