New Method for Spectral Data Classification: Two-Way Moving Window Principal Component Analysis

Two-way moving window principal component analysis (TMWPCA), which considers all possible variable regions by using variable and sample moving windows, is proposed as a new spectral data classification method. In TMWPCA, the similarity between model function and the index obtained by variable and sample moving windows is defined as “fitness”. For each variable region selected by a variable moving window, the fitness is obtained through the use of a model function. By maximizing the fitness, an optimal variable region can be searched. A remarkable advantage of TMWPCA is that it offers an optimal variable region for the classification. To demonstrate the potential of TMWPCA, it has been applied to the classification of visible–near-infrared (Vis-NIR) spectra of mastitic and healthy udder quarters of cows measured in a nondestructive manner. The misclassification rate of TMWPCA has been compared with those of other chemometric methods, such as principal component analysis (PCA), soft independent modeling of class analogies (SIMCA), and principal discriminant variate (PDV). TMWPCA has yielded the lowest misclassification rate. The result indicates that TMWPCA is a powerful tool for the classification of spectral data.

[1]  P. Gemperline,et al.  Raw materials testing using soft independent modeling of class analogy analysis of near-infrared reflectance spectra , 1989 .

[2]  R Tsenkova,et al.  Near infrared spectroscopy for biomonitoring: cow milk composition measurement in a spectral region from 1,100 to 2,400 nanometers. , 2000, Journal of animal science.

[3]  T Fearn,et al.  Near-infrared spectroscopy for dairy management: measurement of unhomogenized milk composition. , 1999, Journal of dairy science.

[4]  Lihua Shen,et al.  Flow injection chemiluminescence determination of epinephrine using epinephrine-imprinted polymer as recognition material , 2003 .

[5]  R. Todeschini k-nearest neighbour method: The influence of data transformations and metrics , 1989 .

[6]  H. Birks Multivariate analysis in geology and geochemistry: An introduction , 1987 .

[7]  Manabu Kano,et al.  Comparison of statistical process monitoring methods: application to the Eastman challenge problem , 2000 .

[8]  P. Hopke,et al.  Classification of Single Particles Analyzed by ATOFMS Using an Artificial Neural Network, ART-2A , 1999 .

[9]  Manabu Kano,et al.  A new multivariate statistical process monitoring method using principal component analysis , 2001 .

[10]  M. Mellinger Multivariate data analysis: Its methods☆ , 1987 .

[11]  Theofanis Sapatinas,et al.  Discriminant Analysis and Statistical Pattern Recognition , 2005 .

[12]  Ru-Qin Yu,et al.  Non‐linear discriminant feature extraction using generalized back‐propagation network , 1996 .

[13]  Tormod Næs,et al.  A user-friendly guide to multivariate calibration and classification , 2002 .

[14]  Jian-hui Jiang,et al.  Principal Discriminant Variate Method for Classification of Multicollinear Data: Applications to Near-Infrared Spectra of Cow Blood Samples , 2002 .

[15]  Manabu Kano,et al.  Comparison of multivariate statistical process monitoring methods with applications to the Eastman challenge problem , 2002 .

[16]  S. Wold,et al.  Wavelength interval selection in multicomponent spectral analysis by moving window partial least-squares regression with applications to mid-infrared and near-infrared spectroscopic data. , 2002, Analytical chemistry.

[17]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[18]  Thomas B. Blank,et al.  Adaptive, global, extended Kalman filters for training feedforward neural networks , 1994 .

[19]  R. Cerri,et al.  Effect of timing of first clinical mastitis occurrence on lactational and reproductive performance of Holstein dairy cows. , 2004, Animal reproduction science.

[20]  Geoffrey J. McLachlan,et al.  Discriminant Analysis and Statistical Pattern Recognition: McLachlan/Discriminant Analysis & Pattern Recog , 2005 .

[21]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[22]  Wojtek J. Krzanowski,et al.  Orthogonal canonical variates for discrimination and classification , 1995 .

[23]  Olav M. Kvalheim,et al.  SIMCA multivariate data analysis of blue mussel components in environmental pollution studies , 1983 .

[24]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .