Fast and Robust Dictionary-based Classification for Image Data

Dictionary-based classification has been promising in knowledge discovery from image data, due to its good performance and interpretable theoretical system. Dictionary learning effectively supports both small- and large-scale datasets, while its robustness and performance depends on the atoms of the dictionary most of the time. Empirically, using a large number of atoms is helpful to obtain a robust classification, while robustness cannot be ensured when setting a small number of atoms. However, learning a huge dictionary dramatically slows down the speed of classification, which is especially worse on the large-scale datasets. To address the problem, we propose a Fast and Robust Dictionary-based Classification (FRDC) framework, which fully utilizes the learned dictionary for classification by staging - and -norms to obtain a robust sparse representation. The new objective function, on the one hand, introduces an additional -norm term upon the conventional -norm optimization, which generates a more robust classification. On the other hand, the optimization based on both - and -norms is solved in two stages, which is much easier and faster than current solutions. In this way, even when using a limited size of dictionary, which makes sure the classification runs very fast, it still can gain higher robustness for multiple types of image data. The optimization is then theoretically analyzed in a new formulation, close but distinct to elastic-net, to prove it is crucial to improve the performance under the premise of robustness. According to our extensive experiments conducted on four image datasets for face and object classification, FRDC keeps generating a robust classification no matter whether using a small or large number of atoms. This guarantees a fast and robust dictionary-based image classification. Furthermore, when simply using deep features extracted via some popular pre-trained neural networks, it outperforms many state-of-the-art methods on the specific datasets.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Zhao Zhang,et al.  Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Waheed Uz Zaman Bajwa,et al.  Cloud K-SVD: A Collaborative Dictionary Learning Algorithm for Big, Distributed Data , 2014, IEEE Transactions on Signal Processing.

[4]  Alberto Del Bimbo,et al.  A Dictionary Learning-Based 3D Morphable Shape Model , 2017, IEEE Transactions on Multimedia.

[5]  Larry S. Davis,et al.  Label Consistent K-SVD: Learning a Discriminative Dictionary for Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jean Ponce,et al.  Task-Driven Dictionary Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Liyi Dai,et al.  Analysis Dictionary Learning Based Classification: Structure for Robustness , 2018, IEEE Transactions on Image Processing.

[9]  Panagiotis Tsakalides,et al.  Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning , 2020, IEEE Transactions on Multimedia.

[10]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Fei Zhou,et al.  Consistent Coding Scheme for Single-Image Super-Resolution Via Independent Dictionaries , 2016, IEEE Transactions on Multimedia.

[12]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[13]  Dapeng Tao,et al.  Discriminative Dictionary Learning-Based Multiple Component Decomposition for Detail-Preserving Noisy Image Fusion , 2020, IEEE Transactions on Instrumentation and Measurement.

[14]  Ehud Rivlin,et al.  On the Equivalence of the LC-KSVD and the D-KSVD Algorithms. , 2017, IEEE transactions on pattern analysis and machine intelligence.

[15]  Michael Elad,et al.  Dictionaries for Sparse Representation Modeling , 2010, Proceedings of the IEEE.

[16]  Changsheng Xu,et al.  Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval , 2015, IEEE Transactions on Multimedia.

[17]  Xiaoqin Zhang,et al.  Self-Taught Semisupervised Dictionary Learning With Nonnegative Constraint , 2020, IEEE Transactions on Industrial Informatics.

[18]  Michael Elad,et al.  Analysis K-SVD: A Dictionary-Learning Algorithm for the Analysis Sparse Model , 2013, IEEE Transactions on Signal Processing.

[19]  Di Guo,et al.  Fast Multiclass Dictionaries Learning With Geometrical Directions in MRI Reconstruction , 2015, IEEE Transactions on Biomedical Engineering.

[20]  Jianping Gou,et al.  Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification , 2017, Neural Computing and Applications.

[21]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[22]  Bob Zhang,et al.  Learning Salient and Discriminative Descriptor for Palmprint Feature Extraction and Identification , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Yicong Zhou,et al.  Prior Knowledge-Based Probabilistic Collaborative Representation for Visual Recognition , 2020, IEEE Transactions on Cybernetics.

[24]  Baihua Xiao,et al.  Multi-Kernel Coupled Projections for Domain Adaptive Dictionary Learning , 2019, IEEE Transactions on Multimedia.

[25]  Jian-Feng Cai,et al.  Data-driven tight frame construction and image denoising , 2014 .

[26]  Ling Shao,et al.  Discriminative Fisher Embedding Dictionary Learning Algorithm for Object Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Jean-Luc Starck,et al.  Sparse Solution of Underdetermined Systems of Linear Equations by Stagewise Orthogonal Matching Pursuit , 2012, IEEE Transactions on Information Theory.

[28]  Ling Shao,et al.  Discriminative Elastic-Net Regularized Linear Regression , 2017, IEEE Transactions on Image Processing.

[29]  Yuan Yan Tang,et al.  Sparse Supervised Representation-Based Classifier for Uncontrolled and Imbalanced Classification , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[30]  Michael Elad,et al.  Theoretical Foundations of Deep Learning via Sparse Representations: A Multilayer Sparse Model and Its Connection to Convolutional Neural Networks , 2018, IEEE Signal Processing Magazine.

[31]  René Vidal,et al.  Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Ehud Rivlin,et al.  On the Equivalence of the LC-KSVD and the D-KSVD Algorithms , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  D. Donoho For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[34]  Roland Vollgraf,et al.  Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[35]  Zhenyu He,et al.  Robust Object Tracking via Key Patch Sparse Representation , 2017, IEEE Transactions on Cybernetics.

[36]  Stefan Wager,et al.  High-Dimensional Asymptotics of Prediction: Ridge Regression and Classification , 2015, 1507.03003.

[37]  Jian Yu,et al.  Group Collaborative Representation for Image Set Classification , 2019, International Journal of Computer Vision.

[38]  Shuang Feng,et al.  Generative and Discriminative Fuzzy Restricted Boltzmann Machine Learning for Text and Image Classification , 2020, IEEE Transactions on Cybernetics.

[39]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[40]  Zhiliang Zhu,et al.  Fast Single Image Super-Resolution via Self-Example Learning and Sparse Representation , 2014, IEEE Transactions on Multimedia.

[41]  Guan-Ju Peng,et al.  Adaptive ADMM for Dictionary Learning in Convolutional Sparse Representation , 2019, IEEE Transactions on Image Processing.

[42]  Shuicheng Yan,et al.  Jointly Learning Structured Analysis Discriminative Dictionary and Analysis Multiclass Classifier , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Jie Qin,et al.  Low-rank analysis-synthesis dictionary learning with adaptively ordinal locality , 2019, Neural Networks.

[44]  Jian Yang,et al.  A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification , 2017, IEEE Transactions on Neural Networks and Learning Systems.