Support Vector Machines: A Recent Method for Classification in Chemometrics

Support Vector Machines (SVMs) are a new generation of classification method. Derived from well principled Statistical Learning theory, this method attempts to produce boundaries between classes by both minimising the empirical error from the training set and also controlling the complexity of the decision boundary, which can be non-linear. SVMs use a kernel matrix to transform a non-linear separation problem in input space to a linear separation problem in feature space. Common kernels include the Radial Basis Function, Polynomial and Sigmoidal Functions. In many simulated studies and real applications, SVMs show superior generalisation performance compared to traditional classification methods. SVMs also provide several useful statistics that can be used for both model selection and feature selection because these statistics are the upper bounds of the generalisation performance estimation of Leave-One-Out Cross-Validation. SVMs can be employed for multiclass problems in addition to the traditional two class application. Various approaches include one-class classifiers, one-against-one, one-against-all and DAG (Directed Acyclic Graph) trees. Methods for feature selection include RFE (Recursive Feature Elimination) and Gradient Descent based approaches.

[1]  B. Walczak,et al.  A modification of the ICOMP criterion for estimation of optimum complexity of PCR models , 2005 .

[2]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[3]  Francis Eng Hock Tay,et al.  Modified support vector machines in financial time series forecasting , 2002, Neurocomputing.

[4]  David G. Stork,et al.  Pattern Classification , 1973 .

[5]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[6]  Antonio J. Serrano,et al.  Profiled support vector machines for antisense oligonucleotide efficacy prediction , 2004, BMC Bioinformatics.

[7]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[8]  Pietro Siciliano,et al.  ANALYSIS OF PEACHES RIPENESS BY AN ELECTRONIC NOSE AND NEAR-INFRARED SPECTROSCOPY , 2002 .

[9]  Melissa Hanna-Brown,et al.  Toxicological classification of urine samples using pattern recognition techniques and capillary electrophoresis , 2004, Analytical and bioanalytical chemistry.

[10]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[11]  Sayan Mukherjee,et al.  Choosing Multiple Parameters for Support Vector Machines , 2002, Machine Learning.

[12]  N. Ancona,et al.  Support vector machines for olfactory signals recognition , 2003 .

[13]  Guodong Guo,et al.  Face recognition by support vector machines , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[14]  Vincent Baeten,et al.  Combination of support vector machines (SVM) and near‐infrared (NIR) imaging spectroscopy for the detection of meat and bone meal (MBM) in compound feeds , 2004 .

[15]  Alain Rakotomamonjy,et al.  Variable Selection Using SVM-based Criteria , 2003, J. Mach. Learn. Res..

[16]  L. Buydens,et al.  Comparing support vector machines to PLS for spectral regression applications , 2004 .

[17]  P Dardenne,et al.  Classification of modified starches by fourier transform infrared spectroscopy using support vector machines. , 2005, Journal of agricultural and food chemistry.

[18]  Bernd Beck,et al.  A support vector machine approach to classify human cytochrome P450 3A4 inhibitors , 2005, J. Comput. Aided Mol. Des..

[19]  Chao He,et al.  Employing optimized combinations of one-class classifiers for automated currency validation , 2004, Pattern Recognit..

[20]  E. Marchiori,et al.  Sample handling for mass spectrometric proteomic investigations of human sera. , 2005, Analytical chemistry.

[21]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[22]  Melanie Hilario,et al.  Mining mass spectra for diagnosis and biomarker discovery of cerebral accidents , 2004, Proteomics.

[23]  B. Walczak,et al.  About kernel latent variable approaches and SVM , 2005 .

[24]  Richard G. Brereton,et al.  Support vector machines for the discrimination of analytical chemical data: application to the determination of tablet production by pyrolysis-gas chromatography-mass spectrometry , 2004 .

[25]  Robert P. W. Duin,et al.  Support vector domain description , 1999, Pattern Recognit. Lett..

[26]  Yun Xu,et al.  Diagnostic Pattern Recognition on Gene-Expression Profile Data by Using One-Class Classification , 2005, J. Chem. Inf. Model..

[27]  L. Buydens,et al.  Facilitating the application of Support Vector Regression by using a universal Pearson VII function based kernel , 2006 .

[28]  David M. J. Tax,et al.  One-class classification , 2001 .

[29]  B. Schölkopf,et al.  Advances in kernel methods: support vector learning , 1999 .

[30]  R. Brereton,et al.  Active learning support vector machines for optimal sample selection in classification , 2004 .

[31]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[32]  V. Vapnik,et al.  Bounds on Error Expectation for Support Vector Machines , 2000, Neural Computation.

[33]  A. Belousov,et al.  Applicational aspects of support vector machines , 2002 .

[34]  A. Belousov,et al.  A flexible classification approach with optimal generalisation performance: support vector machines , 2002 .

[35]  Tomasz Markiewicz,et al.  Classification of milk by means of an electronic nose and SVM neural network , 2004 .

[36]  L. Buydens,et al.  Multivariate calibration with least-squares support vector machines. , 2004, Analytical chemistry.

[37]  Giovanni Luca Christian Masala,et al.  A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening , 2003 .

[38]  Guodong Guo,et al.  Support vector machines for face recognition , 2001, Image Vis. Comput..