Critical comparison of methods predicting the number of components in spectroscopic data

Determining the number of chemical components in a mixture is a first important step to further analysis in spectroscopy. The accuracy of 13 statistical indices for estimation of the number of components that contribute to spectra was critically tested on simulated and on experimental data sets using algorithm INDICES in S-Plus. All methods are classified into two categories, precise methods based upon a knowledge of the instrumental error of the absorbance data, sinst (A), and approximate methods requiring no such knowledge. Most indices always predict the correct number of components even a presence of the minor one when the signal-to-error ratio (SER) is higher than 10 but in case of RESO and IND higher than 6. On base of SER the detection limit of every index method is estimated. Two indices, RESO and IND, correctly predict a minor component in a mixture even if its relative concentration is 0.5‐1% and solve an ill-defined problem with severe collinearity. For more than four components in a mixture the modifications of Elbergali et al. represent a useful resolution tool of a correct number of components in spectra for all indices. The Wernimont‐Kankare procedure performs reliable determination of the instrumental standard deviation of spectrophotometer used. In case of real experimental data the RESO, IND and indices methods based on knowledge of instrumental error should be preferred. © 2000 Elsevier Science B.V. All rights reserved.

[1]  Stephen G. Walburn,et al.  Collection and analysis of hazardous organic emissions , 1982 .

[2]  M. Meloun,et al.  Computation of Solution Equilibria , 1988 .

[3]  R. Brereton,et al.  Influence of predicted elution regions on the performance of methods for evolutionary factor analysis as applied to high-performance liquid chromatography , 1995 .

[4]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[5]  Mikael Kubista,et al.  An automated procedure to predict the number of components in spectroscopic data , 1999 .

[6]  T. J. Klingen,et al.  Correlation of retention volumes of substitutued carboranes with molecular properties in high pressure liquid chromatography using factor analysis , 1974 .

[7]  Yang Li,et al.  Determination of the number of components in mixtures using a new approach incorporating chemical information , 1999 .

[8]  Edmund R. Malinowski,et al.  Abstract factor analysis of data with multiple sources of error and a modified Faber–Kowalski f‐test † , 1999 .

[9]  D. Massart Chemometrics: A Textbook , 1988 .

[10]  R. Brereton,et al.  Resolution of strongly overlapping two‐way multicomponent data by means of heuristic evolving latent projections , 1993 .

[11]  Smoothed window factor analysis , 1999 .

[12]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  D. Cox,et al.  Analysis of Survival Data. , 1986 .

[15]  John M. Deane,et al.  Testing for redundancy in product quality control test criteria: An application to aviation turbine fuel , 1989 .

[16]  A. A. El-Awady,et al.  Determination of the number of species present in a system. New matrix rank treatment of spectrophotometric data , 1971 .

[17]  J. Kankare Computation of equilibrium constants for multicomponent systems from spectrophotometric data , 1970 .

[18]  Edmund R. Malinowski,et al.  Determination of the number of factors and the experimental error in a data matrix , 1977 .