One‐class classifiers

The principles of one‐class classifiers are introduced, together with the distinctions between one‐class/multiclass, soft/hard, conjoint/disjoint and modelling/discriminatory methods. The methods are illustrated using case studies, namely from nuclear magnetic resonance metabolomic profiling, thermal analysis of polymers and simulations. Two main groups of classifier are described, namely statistically based distance metrics from centroids (Euclidean distance and quadratic discriminant analysis) and support vector domain description (SVDD). The statistical basis of the D statistic and its relationship with the F statistic, χ2, normal distribution and T2 is discussed. The SVDD D value is described. Methods for assessing the distance of residuals to disjoint principal component models (Q statistic) and their combination with distance‐based methods to give the G statistic are outlined. Copyright © 2011 John Wiley & Sons, Ltd.

[1]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[2]  R. Brereton,et al.  Disjoint hard models for classification , 2010 .

[3]  R. Brereton,et al.  One class classifiers for process monitoring illustrated by the application to online HPLC of a continuous process , 2010 .

[4]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.

[5]  R. Brereton,et al.  Supervised self organizing maps for classification and determination of potentially discriminatory variables: illustrated by application to nuclear magnetic resonance metabolomic profiling. , 2010, Analytical chemistry.

[6]  Martin Grootveld,et al.  Self Organising Maps for variable selection: Application to human saliva analysed by nuclear magnetic resonance spectroscopy to investigate the effect of an oral healthcare product , 2009 .

[7]  Richard G. Brereton,et al.  Chemometrics for Pattern Recognition , 2009 .

[8]  R. Brereton,et al.  Comparison of performance of five common classifiers represented as boundary methods: Euclidean Distance to Centroids, Linear Discriminant Analysis, Quadratic Discriminant Analysis, Learning Vector Quantization and Support Vector Machines, as dependent on data structure , 2009 .

[9]  Saiful Islam,et al.  Mahalanobis Distance , 2009, Encyclopedia of Biometrics.

[10]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[11]  A. Pomerantsev Acceptance areas for multivariate classification derived by projection methods , 2008 .

[12]  Richard G Brereton,et al.  Self Organising Maps for distinguishing polymer groups using thermal response curves obtained by dynamic mechanical analysis. , 2008, The Analyst.

[13]  K. Héberger,et al.  Supervised pattern recognition in food analysis. , 2007, Journal of chromatography. A.

[14]  Richard G. Brereton,et al.  Learning Vector Quantization for Multiclass Classification: Application to Characterization of Plastics , 2007, J. Chem. Inf. Model..

[15]  Richard G. Brereton,et al.  Pattern recognition and feature selection for the discrimination between grades of commercial plastics , 2007 .

[16]  R. Brereton,et al.  Dynamic mechanical analysis and chemometrics for polymer identification , 2007 .

[17]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[18]  R. Brereton,et al.  Pattern recognition for the analysis of polymeric materials. , 2006, The Analyst.

[19]  Yun Xu,et al.  Support Vector Machines: A Recent Method for Classification in Chemometrics , 2006 .

[20]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[22]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[23]  S. Joe Qin,et al.  Statistical process monitoring: basics and beyond , 2003 .

[24]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[25]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[26]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[27]  Tordis E. Morud,et al.  Multivariate statistical process control; example from the chemical process industry , 1996 .

[28]  John F. MacGregor,et al.  Multivariate SPC charts for monitoring batch processes , 1995 .

[29]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .

[30]  J. Friedman Regularized Discriminant Analysis , 1989 .

[31]  B. Manly Multivariate Statistical Methods : A Primer , 1986 .

[32]  D. Massart,et al.  UNEQ: a disjoint modelling technique for pattern recognition based on normal distribution , 1986 .

[33]  J. E. Jackson,et al.  Control Procedures for Residuals Associated With Principal Component Analysis , 1979 .

[34]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[35]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[36]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification , 1954 .

[37]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .