Rank determination of spectroscopic profiles by means of cross validation: The effect of replicate measurements on the effective degrees of freedom

Abstract In the present work the effect of sample replication and ‘true’ instrumental resolution on the number of significant latent variables obtained by cross validation is investigated. Three sample sets containing three and four chemical components were prepared according to mixture design. The samples were analysed by Fourier transform infrared spectroscopy, and cross validation was performed using the predicted residual error sum of squares (PRESS) algorithm with a varying number of degrees of freedom. It was found that the correct number of chemical components could be estimated precisely from the instrumental data (the X matrix) presuming that the degrees of freedom were corrected for the above-mentioned effects. Grung, B. and Kvalheim, O.M., 1994. Rank determination of spectroscopic profiles by means of cross validation. The effect of replicate measurements on the effective degrees of freedom. Chemometrics and Intelligent Laboratory Systems , 22: 115–125.

[1]  Svante Wold,et al.  Comments on a recent evaluation of the SIMCA method , 1987 .

[2]  Edmund R. Malinowski,et al.  Statistical F‐tests for abstract factor analysis and target testing , 1989 .

[3]  J. van der Greef,et al.  An evaluation of SIMCA. Part 2 — classification of pyrolysis mass spectra of pseudomonas and serratia bacteria by pattern recognition using the SIMCA classifier , 1987 .

[4]  Gerrit Kateman,et al.  Realistic simulations of high-performance liquid chromatographic—ultraviolet data for the evaluation of multivariate techniques , 1991 .

[5]  Olav M. Kvalheim,et al.  A general-purpose program for multivariate data analysis , 1987 .

[6]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[7]  H. R. Keller,et al.  Heuristic evolving latent projections: resolving two-way multicomponent data. 2. Detection and resolution of minor constituents , 1992 .

[8]  Yizeng Liang,et al.  Heuristic evolving latent projections: resolving two-way multicomponent data. 1. Selectivity, latent-projective graph, datascope, local rank, and unique resolution , 1992 .

[9]  A. Lorber Error propagation and figures of merit for quantification by solving matrix equations , 1986 .

[10]  Terje V. Karstang,et al.  Analysis of Nontransparent Polymers: Mixture Design, Second-Derivative Attenuated Total Internal Reflectance FT-IR, and Multivariate Calibration , 1992 .

[11]  W. Full,et al.  Optimal Configuration and Information Content of Sets of Frequency Distributions , 1984 .

[12]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[13]  F. Mosteller,et al.  Inference in an Authorship Problem , 1963 .

[14]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[15]  Theory of error applied to pure test vectors in target factor analysis , 1981 .

[16]  H. Van 'T Klooster,et al.  An evaluation of SIMCA. Part 1 — the reliability of the SIMCA pattern recognition method for a varying number of objects and features , 1987 .