MaskLayer: Enabling scalable deep learning solutions by training embedded feature sets

Deep learning-based methods have shown to achieve excellent results in a variety of domains, however, some important assets are absent. Quality scalability is one of them. In this work, we introduce a novel and generic neural network layer, named MaskLayer. It can be integrated in any feedforward network, allowing quality scalability by design by creating embedded feature sets. These are obtained by imposing a specific structure of the feature vector during training. To further improve the performance, a masked optimizer and a balancing gradient rescaling approach are proposed. Our experiments show that the cost of introducing scalability using MaskLayer remains limited. In order to prove its generality and applicability, we integrated the proposed techniques in existing, non-scalable networks for point cloud compression and semantic hashing with excellent results. To the best of our knowledge, this is the first work presenting a generic solution able to achieve quality scalable results within the deep learning framework.

[1]  Li Li,et al.  Scalable Point Cloud Geometry Coding with Binary Tree Embedded Quadtree , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[2]  Xin Wang,et al.  SkipNet: Learning Dynamic Routing in Convolutional Networks , 2017, ECCV.

[3]  Jon Sneyers,et al.  FLIF: Free lossless image format based on MANIAC compression , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[4]  Jiwen Lu,et al.  Runtime Neural Pruning , 2017, NIPS.

[5]  Serge J. Belongie,et al.  Convolutional Networks with Adaptive Inference Graphs , 2017, International Journal of Computer Vision.

[6]  Minho Lee,et al.  Fast learning method for convolutional neural networks using extreme learning machine and its application to lane detection , 2017, Neural Networks.

[7]  Thomas Wiegand,et al.  Mobile Video Transmission Using Scalable Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[9]  Lubomir D. Bourdev,et al.  Real-Time Adaptive Image Compression , 2017, ICML.

[10]  Rufael Mekuria,et al.  Emerging MPEG Standards for Point Cloud Compression , 2019, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[11]  Titus B. Zaharia,et al.  3D Point Cloud Compression: A Survey , 2019, Web3D.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Jose M. Alcaraz Calero,et al.  5G-UHD: Design, prototyping and empirical evaluation of adaptive Ultra-High-Definition video streaming based on scalable H.265 in virtualised 5G networks , 2017, Comput. Commun..

[14]  Margaret Lech,et al.  Evaluating deep learning architectures for Speech Emotion Recognition , 2017, Neural Networks.

[15]  Meenakshisundaram Gopi,et al.  A Generic Scheme for Progressive Point Cloud Coding , 2008, IEEE Transactions on Visualization and Computer Graphics.

[16]  Yingfeng Cai,et al.  Salient object detection based on multi-scale contrast , 2018, Neural Networks.

[17]  David Minnen,et al.  Variational image compression with a scale hyperprior , 2018, ICLR.

[18]  Touradj Ebrahimi,et al.  The JPEG 2000 still image compression standard , 2001, IEEE Signal Process. Mag..

[19]  Adrian Munteanu,et al.  Deep-Learning-Based Lossless Image Coding , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Juan Antonio Álvarez,et al.  Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods , 2018, Neural Networks.

[21]  Xiaodong Gu,et al.  Towards dropout training for convolutional neural networks , 2015, Neural Networks.

[22]  Rufael Mekuria,et al.  An improved enhancement layer for octree based point cloud compression with plane projection approximation , 2016, Optical Engineering + Applications.

[23]  Ricardo L. de Queiroz,et al.  Intra-Frame Context-Based Octree Coding for Point-Cloud Geometry , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[24]  Yong Liu,et al.  3D Point Cloud Geometry Compression on Deep Learning , 2019, ACM Multimedia.

[25]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[26]  Yue Wang,et al.  Dynamic Graph CNN for Learning on Point Clouds , 2018, ACM Trans. Graph..