Cross-Entropy Clustering Approach to One-Class Classification

Cross-entropy clustering (CEC) is a density model based clustering algorithm. In this paper we apply CEC to the one-class classification, which has several advantages over classical approaches based on Expectation Maximization (EM) and Support Vector Machines (SVM). More precisely, our model allows the use of various types of gaussian models with low computational complexity. We test the designed method on real data coming from the monitoring systems of wind turbines.

[1]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[2]  Gérard Govaert,et al.  Gaussian parsimonious clustering models , 1995, Pattern Recognit..

[3]  Jacek Tabor,et al.  Detection of Disk-Like Particles in Electron Microscopy Images , 2013, CORES.

[4]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[5]  Jacek Tabor,et al.  Detection of elliptical shapes via cross-entropy clustering , 2013, IbPRIA.

[6]  Stefanie Nowak,et al.  Using one-class SVM outliers detection for verification of collaboratively tagged image training sets , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[7]  Christopher M. Bishop,et al.  Novelty detection and neural network validation , 1994 .

[8]  A. Bielecki,et al.  Wind speed modelling using Weierstrass function fitted by a genetic algorithm , 2012 .

[9]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[10]  A. Bowman,et al.  A look at some data on the old faithful geyser , 1990 .

[11]  Robert P. W. Duin,et al.  Outlier Detection Using Classifier Instability , 1998, SSPR/SPR.

[12]  Tomasz Barszcz,et al.  ART-2 Artificial Neural Networks Applications for Classification of Vibration Signals and Operational States of Wind Turbines for Intelligent Monitoring , 2014 .

[13]  Jürgen Bajorath,et al.  Virtual screening methods that complement HTS. , 2004, Combinatorial chemistry & high throughput screening.

[14]  Clintin Davis-Stober,et al.  Exploratory data analysis with MATLAB. , 2007 .

[15]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[16]  Tomasz Barszcz,et al.  ART-Type Artificial Neural Networks Applications for Classification of Operational States in Wind Turbines , 2010, ICAISC.

[17]  Tomasz Barszcz,et al.  Modelling of a chaotic load of wind turbines drivetrain , 2015 .

[18]  Osman F Güner,et al.  History and evolution of the pharmacophore concept in computer-aided drug design. , 2002, Current topics in medicinal chemistry.

[19]  Tomasz Barszcz,et al.  Wind Turbines States Classification by a Fuzzy-ART Neural Network with a Stereographic Projection as a Signal Normalization , 2011, ICANNGA.

[20]  Jacek Tabor,et al.  Asymmetric Clustering Index in a Case Study of 5-HT1A Receptor Ligands , 2014, PloS one.

[21]  Tomasz Barszcz,et al.  Hybrid System of ART and RBF Neural Networks for Classification of Vibration Signals and Operational States of Wind Turbines , 2014, ICAISC.

[22]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[23]  Lionel Tarassenko,et al.  Novelty detection for the identification of abnormalities , 2000, Int. J. Syst. Sci..

[24]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[25]  J Tabor,et al.  Cross-entropy clustering , 2012, Pattern Recognit..

[26]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[27]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .