One-Class Kernel Spectral Regression for Outlier Detection

The paper introduces a new efficient nonlinear one-class classifier formulated as the Rayleigh quotient criterion optimisation. The method, operating in a reproducing kernel Hilbert subspace, minimises the scatter of target distribution along an optimal projection direction while at the same time keeping projections of positive observations distant from the mean of the negative class. We provide a graph embedding view of the problem which can then be solved efficiently using the spectral regression approach. In this sense, unlike previous similar methods which often require costly eigen-computations of dense matrices, the proposed approach casts the problem under consideration into a regression framework which is computationally more efficient. In particular, it is shown that the dominant complexity of the proposed method is the complexity of computing the kernel matrix. Additional appealing characteristics of the proposed one-class classifier are: 1-the ability to be trained in an incremental fashion (allowing for application in streaming data scenarios while also reducing the computational complexity in a non-streaming operation mode); 2-being unsupervised, but providing the option for refining the solution using negative training examples, when available; Last but not least, 3-the use of the kernel trick which facilitates a nonlinear mapping of the data into a high-dimensional feature space to seek better solutions.

[1]  Christopher M. Bishop,et al.  Novelty detection and neural network validation , 1994 .

[2]  Antonio J. Plaza,et al.  One-Class Classification of Remote Sensing Images Using Kernel Sparse Representation , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[3]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[4]  Robert P. W. Duin,et al.  One-Class LP Classifiers for Dissimilarity Representations , 2002, NIPS.

[5]  Shehroz S. Khan,et al.  Relationship between Variants of One-Class Nearest Neighbors and Creating Their Accurate Ensembles , 2016, IEEE Transactions on Knowledge and Data Engineering.

[6]  Joachim Denzler,et al.  Kernel Null Space Methods for Novelty Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[8]  Vadlamani Ravi,et al.  Credit Card Fraud Detection using Big Data Analytics: Use of PSOAANN based One-Class Classification , 2016, ICIA.

[9]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Kevin Chen-Chuan Chang,et al.  A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[11]  David D. Jensen,et al.  Classifier-Adjusted Density Estimation for Anomaly Detection and One-Class Classification , 2014, SDM.

[12]  Paul Honeine,et al.  ${l_p}$-norms in One-Class Classification for Intrusion Detection in SCADA Systems , 2014, IEEE Transactions on Industrial Informatics.

[13]  Mahmood Fathy,et al.  Video anomaly detection and localisation based on the sparsity and reconstruction error of auto-encoder , 2016 .

[14]  William J. Christmas,et al.  An anomaly detection approach to face spoofing detection: A new formulation and evaluation protocol , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[15]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[16]  Heiko Hoffmann,et al.  Kernel PCA for novelty detection , 2007, Pattern Recognit..

[17]  Gunter Ritter,et al.  Outliers in statistical pattern recognition and an application to automatic chromosome classification , 1997, Pattern Recognit. Lett..

[18]  Franck Dufrenois,et al.  A One-Class Kernel Fisher Criterion for Outlier Detection , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Sébastien Marcel,et al.  The Replay-Mobile Face Presentation-Attack Database , 2016, 2016 International Conference of the Biometrics Special Interest Group (BIOSIG).

[20]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[21]  Jiawei Han,et al.  Speed up kernel discriminant analysis , 2011, The VLDB Journal.

[22]  Robert P. W. Duin,et al.  Combining One-Class Classifiers , 2001, Multiple Classifier Systems.

[23]  Jean-Charles Noyer,et al.  A null space based one class kernel Fisher discriminant , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[24]  Asma Rabaoui,et al.  Using One-Class SVMs and Wavelets for Audio Surveillance , 2008, IEEE Transactions on Information Forensics and Security.

[25]  Volker Roth,et al.  Outlier Detection with One-class Kernel Fisher Discriminants , 2004, NIPS.

[26]  F. Dufrenois,et al.  One class proximal support vector machines , 2016, Pattern Recognit..

[27]  Miao Yu,et al.  An Online One Class Support Vector Machine-Based Person-Specific Fall Detection System for Monitoring an Elderly Individual in a Room Environment , 2013, IEEE Journal of Biomedical and Health Informatics.

[28]  Huangang Wang,et al.  Parameter Selection of Gaussian Kernel for One-Class SVM , 2015, IEEE Transactions on Cybernetics.

[29]  Shehroz S. Khan,et al.  One-class classification: taxonomy of study and review of techniques , 2013, The Knowledge Engineering Review.

[30]  Petia Radeva,et al.  Approximate Convex Hulls Family for One-Class Classification , 2011, MCS.

[31]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[32]  Josef Kittler,et al.  Class-Specific Kernel Fusion of Multiple Descriptors for Face Verification Using Multiscale Binarised Statistical Image Features , 2014, IEEE Transactions on Information Forensics and Security.

[33]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[34]  Norman Poh,et al.  One dimensional local binary patterns of electroencephalogram signals for detecting Alzheimer's disease , 2017, 2017 22nd International Conference on Digital Signal Processing (DSP).

[35]  Nojun Kwak,et al.  Principal Component Analysis Based on L1-Norm Maximization , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Volker Roth,et al.  Kernel Fisher Discriminants for Outlier Detection , 2006, Neural Computation.

[37]  Jiawei Han,et al.  Spectral Regression: A Regression Framework for Efficient Regularized Subspace Learning , 2009 .

[38]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[39]  M. M. Moya,et al.  One-class classifier networks for target recognition applications , 1993 .

[40]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[41]  Joachim Denzler,et al.  One-class classification with Gaussian processes , 2010, Pattern Recognit..

[42]  Vadlamani Ravi,et al.  One-class support vector machine based undersampling: Application to churn prediction and insurance fraud detection , 2015, 2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC).

[43]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[44]  Terrance E. Boult,et al.  Probability Models for Open Set Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Vishal M. Patel,et al.  Learning Deep Features for One-Class Classification , 2018, IEEE Transactions on Image Processing.

[46]  Robert P. W. Duin,et al.  Support objects for domain approximation , 1998 .

[47]  Joachim Denzler,et al.  Local Novelty Detection in Multi-class Recognition Problems , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[48]  Amparo Alonso-Betanzos,et al.  One-Class Convex Hull-Based Algorithm for Classification in Distributed Environments , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[49]  T. C. Minter,et al.  Single-Class Classification , 1975 .

[50]  Gian Antonio Susto,et al.  A One-Class SVM Based Tool for Machine Learning Novelty Detection in HVAC Chiller Systems , 2014 .

[51]  Ashok N. Srivastava,et al.  Anomaly Detection and Diagnosis Algorithms for Discrete Symbol Sequences with Applications to Airline Safety , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[52]  Jean-Charles Noyer,et al.  Formulating Robust Linear Regression Estimation as a One-Class LDA Criterion: Discriminative Hat Matrix , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[53]  S. V. N. Vishwanathan,et al.  Fast Iterative Kernel Principal Component Analysis , 2007, J. Mach. Learn. Res..

[54]  Miguel Nicolau,et al.  Learning Neural Representations for Network Anomaly Detection , 2019, IEEE Transactions on Cybernetics.

[55]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[56]  David Windridge,et al.  Domain Anomaly Detection in Machine Perception: A System Architecture and Taxonomy , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Nathalie Japkowicz,et al.  Concept learning in the absence of counterexamples: an autoassociation-based approach to classification , 1999 .

[58]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[59]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[60]  Mahmood Fathy,et al.  Adversarially Learned One-Class Classifier for Novelty Detection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[61]  Yi Wang,et al.  Incremental Kernel Null Space Discriminant Analysis for Novelty Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).