Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes

Existing models for shape analysis directly learn feature representations on 3D point clouds. We argue that 3D point clouds are highly redundant and hold irregular (permutation-invariant) structure, which makes it difficult to achieve inter-class discrimination efficiently. In this paper, we propose a two-pronged solution to this problem that is seamlessly integrated in a single blended convolution and synthesis layer. This fully differentiable layer performs two critical tasks in succession. In the first step, it projects the input 3D point clouds into a latent 3D space to synthesize a highly compact and inter-class discriminative point cloud representation. Since, 3D point clouds do not follow a Euclidean topology, standard 2/3D convolutional neural networks offer limited representation capability. Therefore, in the second step, we propose a novel 3D convolution operator functioning inside the unit ball to extract useful volumetric features. We derive formulae to achieve both translation and rotation of our novel convolution kernels. Finally, using the proposed techniques we present an extremely light-weight, end-to-end architecture that achieves compelling results on 3D shape recognition and retrieval.

[1]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[2]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[3]  Donald Meagher,et al.  Geometric modeling using octree encoding , 1982, Comput. Graph. Image Process..

[4]  Rama Chellappa,et al.  Classification of Partial 2-D Shapes Using Fourier Descriptors , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Wesley E. Snyder,et al.  Application of Affine-Invariant Fourier Descriptors to Recognition of 3-D Objects , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  T. H. Reiss Features invariant to linear transformations in 2D and 3D , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol. III. Conference C: Image, Speech and Signal Analysis,.

[7]  Xuan Guo,et al.  Three-Dimensional Moment Invariants under Rigid Transformation , 1993, CAIP.

[8]  Wageeh W. Boles,et al.  An application of wavelet-based affine-invariant representation , 1995, Pattern Recognit. Lett..

[9]  Jan Flusser,et al.  Vertex-based features for recognition of projectively deformed polygons , 1996, Pattern Recognit..

[10]  Nikolaos Canterakis,et al.  3D Zernike Moments and Zernike Affine Invariants for 3D Image Analysis and Recognition , 1999 .

[11]  Dietmar Saupe,et al.  Tools for 3D-object retrieval: Karhunen-Loeve transform and spherical harmonics , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[12]  Mahmoud I. Khalil,et al.  A Dyadic Wavelet Affine Invariant Function for 2D Shape Recognition , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Michael Elad,et al.  Content Based Retrieval of VRML Objects - An Iterative and Interactive Approach , 2001, Eurographics Multimedia Workshop.

[14]  Dietmar Saupe,et al.  Description of 3D-shape using a complex function on the sphere , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[15]  Jan Flusser,et al.  Moment Forms Invariant to Rotation and Blur in Arbitrary Number of Dimensions , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[17]  Raif M. Rustamov,et al.  Laplace-Beltrami eigenfunctions for deformation invariant shape representation , 2007 .

[18]  Hao Zhang,et al.  A spectral approach to shape-based retrieval of articulated 3D models , 2007, Comput. Aided Des..

[19]  Ioannis Pratikakis,et al.  3D Object Retrieval using an Efficient and Compact Hybrid Shape Descriptor , 2008, 3DOR@Eurographics.

[20]  M. Mintun,et al.  Brain volume decline in aging: evidence for a relation between socioeconomic status, preclinical Alzheimer disease, and reserve. , 2008, Archives of neurology.

[21]  Masaki Aono,et al.  Multi-Fourier spectra descriptor and augmentation with spectral clustering for 3D shape retrieval , 2009, The Visual Computer.

[22]  A. Bronstein,et al.  Shape Google : a computer vision approach to invariant shape retrieval , 2009 .

[23]  Guillermo Sapiro,et al.  A Gromov-Hausdorff Framework with Diffusion Geometry for Topologically-Robust Non-rigid Shape Matching , 2010, International Journal of Computer Vision.

[24]  Ioannis Pratikakis,et al.  Retrieval of 3D Articulated Objects Using a Graph-based Representation , 2009, 3DOR@Eurographics.

[25]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[26]  Guillaume Lavoué,et al.  Combination of bag-of-words descriptors for robust partial shape retrieval , 2012, The Visual Computer.

[27]  Hamid Laga,et al.  Compact Vectors of Locally Aggregated Tensors for 3D Shape Retrieval , 2013, 3DOR@Eurographics.

[28]  Hamid Laga,et al.  Covariance Descriptors for 3D Shape Matching and Retrieval , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Alexander M. Bronstein,et al.  Supervised learning of bag‐of‐features shape descriptors using sparse coding , 2014, Comput. Graph. Forum.

[30]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[31]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[32]  Mohammed Bennamoun,et al.  A Comprehensive Performance Evaluation of 3D Local Feature Descriptors , 2015, International Journal of Computer Vision.

[33]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Edward K. Wong,et al.  Deepshape: Deep learned shape descriptor for 3D shape matching and retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Zhichao Zhou,et al.  DeepPano: Deep Panoramic Representation for 3-D Shape Recognition , 2015, IEEE Signal Processing Letters.

[36]  Stefan Leutenegger,et al.  Pairwise Decomposition of Image Sequences for Active Multi-view Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Leonidas J. Guibas,et al.  FPNN: Field Probing Neural Networks for 3D Data , 2016, NIPS.

[38]  Longin Jan Latecki,et al.  GIFT: A Real-Time and Scalable 3D Shape Search Engine , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Jiajun Wu,et al.  Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling , 2016, NIPS.

[40]  Meng Wang,et al.  Learned Binary Spectral Shape Descriptor for 3D Shape Correspondence , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Theodore Lim,et al.  Generative and Discriminative Voxel Modeling with Convolutional Neural Networks , 2016, ArXiv.

[42]  Reza Bosagh Zadeh,et al.  FusionNet: 3D Object Classification Using Multiple Data Representations , 2016, ArXiv.

[43]  Ryutarou Ohbuchi,et al.  Deep Aggregation of Local 3D Geometric Features for 3D Model Retrieval , 2016, BMVC.

[44]  Nikos Komodakis,et al.  Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Gernot Riegler,et al.  OctNet: Learning Deep 3D Representations at High Resolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[47]  Le Song,et al.  Deep Hyperspherical Learning , 2017, NIPS.

[48]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Anath Fischer,et al.  3D Point Cloud Classification and Segmentation using 3D Modified Fisher Vector Representation for Convolutional Neural Networks , 2017, ArXiv.

[50]  Victor S. Lempitsky,et al.  Escape from Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[51]  Yaron Lipman,et al.  Point convolutional neural networks by extension operators , 2018, ACM Trans. Graph..

[52]  Baoquan Chen,et al.  PointCNN , 2018, NIPS 2018.

[53]  Kaleem Siddiqi,et al.  Local Spectral Graph Convolution for Point Set Feature Learning , 2018, ECCV.

[54]  Kostas Daniilidis,et al.  Learning SO(3) Equivariant Representations with Spherical CNNs , 2017, International Journal of Computer Vision.

[55]  Jiaxin Li,et al.  SO-Net: Self-Organizing Network for Point Cloud Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[56]  Max Welling,et al.  3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data , 2018, NeurIPS.

[57]  Kaiming He,et al.  Group Normalization , 2018, ECCV.

[58]  Baoquan Chen,et al.  PointCNN: Convolution On $\mathcal{X}$-Transformed Points , 2018 .

[59]  Nick Barnes,et al.  Adversarial Training of Variational Auto-Encoders for High Fidelity Image Generation , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[60]  Junwei Han,et al.  Deep Spatiality: Unsupervised Learning of Spatially-Enhanced Global and Local 3D Features by Deep Neural Network With Coupled Softmax , 2018, IEEE Transactions on Image Processing.

[61]  Max Welling,et al.  Spherical CNNs , 2018, ICLR.

[62]  Junsong Yuan,et al.  Multi-view Harmonized Bilinear Network for 3D Object Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[63]  Leonidas J. Guibas,et al.  TextureNet: Consistent Local Parametrizations for Learning From High-Resolution Signals on Meshes , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[64]  Nick Barnes,et al.  Volumetric Convolution: Automatic Representation Learning in Unit Ball , 2019, ArXiv.

[65]  Nick Barnes,et al.  Representation Learning on Unit Ball with 3D Roto-translational Equivariance , 2019, International Journal of Computer Vision.

[66]  Hui Zeng,et al.  Dempster-Shafer evidence theory-based multi-feature learning and fusion method for non-rigid 3D model retrieval , 2019, IET Comput. Vis..

[67]  Shanmuganathan Raman,et al.  LP-3DCNN: Unveiling Local Phase in 3D Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[68]  Lars Petersson,et al.  Mitigating the Hubness Problem for Zero-Shot Learning of 3D Objects , 2019, BMVC.

[69]  Matthias Nießner,et al.  Convolutional Neural Networks on non-uniform geometrical signals using Euclidean spectral transformation , 2019, ICLR.

[70]  Lars Petersson,et al.  Zero-shot Learning of 3D Point Cloud Objects , 2019, 2019 16th International Conference on Machine Vision Applications (MVA).

[71]  Yue Wang,et al.  Dynamic Graph CNN for Learning on Point Clouds , 2018, ACM Trans. Graph..

[72]  Lars Petersson,et al.  3DCapsule: Extending the Capsule Architecture to Classify 3D Point Clouds , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[73]  Zeyun Yu,et al.  An Application of Manifold Learning in Global Shape Descriptors , 2019, Algorithms.

[74]  Lars Petersson,et al.  Transductive Zero-Shot Learning for 3D Point Cloud Classification , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).