A Study on the Influence of Shape in Classifying Small Spectral Data Sets

Classification of spectral data has raised a growing interest in may research areas. However, this type of data usually suffers from the curse of dimensionality. This causes most statistical methods and/or classifiers to not perform well. A recently proposed alternative which can help avoiding this problem is the Dissimilarity Representation, in which objects are represented by their dissimilarities to representative objects of each class. However, this approach depends on the selection of a suitable dissimilarity measure. For spectra, the incorporation of information on their shape, can be significant for a good discrimination. In this paper, we make a study on the benefit of using a measure which takes shape of spectra into account. We show that the shape-based measure not only leads to better classification results, but that a certain number of objects is enough to achieve it. The experiments are conducted on three onedimensional data sets and a two-dimensional one.

[1]  Keinosuke Fukunaga,et al.  Effects of Sample Size in Classifier Design , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Anil K. Jain,et al.  Small Sample Size Effects in Statistical Pattern Recognition: Recommendations for Practitioners , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Robert P. W. Duin,et al.  Classifiers in almost empty spaces , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[4]  Jian Yang,et al.  From image vector to matrix: a straightforward image projection technique - IMPCA vs. PCA , 2002, Pattern Recognit..

[5]  Robert P. W. Duin,et al.  Dissimilarity-based classification of spectra: computational issues , 2003, Real Time Imaging.

[6]  Alejandro F. Frangi,et al.  Two-dimensional PCA: a new approach to appearance-based face representation and recognition , 2004 .

[7]  Robert P. W. Duin,et al.  The Dissimilarity Representation for Pattern Recognition - Foundations and Applications , 2005, Series in Machine Perception and Artificial Intelligence.

[8]  Robert P. W. Duin,et al.  DISSIMILARITY-BASED CLASSIFICATION OF SEISMIC SIGNALS AT NEVADO DEL RUIZ VOLCANO , 2006 .

[9]  David Zhang,et al.  An assembled matrix distance metric for 2DPCA-based image recognition , 2006, Pattern Recognit. Lett..

[10]  R. Bro,et al.  Multiblock variance partitioning: a new approach for comparing variation in multiple data blocks. , 2008, Analytica chimica acta.

[11]  Robert P. W. Duin,et al.  The Representation of Chemical Spectral Data for Classification , 2009, CIARP.

[12]  R. Duin,et al.  The dissimilarity representation for pattern recognition , a tutorial , 2009 .

[13]  Edwin R. Hancock,et al.  Structural, Syntactic, and Statistical Pattern Recognition, Joint IAPR International Workshop, SSPR&SPR 2010, Cesme, Izmir, Turkey, August 18-20, 2010. Proceedings , 2010, SSPR/SPR.

[14]  Robert P. W. Duin,et al.  The Dissimilarity Representation as a Tool for Three-Way Data Classification: A 2D Measure , 2010, SSPR/SPR.

[15]  R. Duin,et al.  Dissimilarity representation on functional spectral data for classification , 2011 .

[16]  Luis Alvarez,et al.  Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications , 2012, Lecture Notes in Computer Science.