Overlapping spectra resolution using non-negative matrix factorization.

Non-negative matrix factorization (NMF), with the constraints of non-negativity, has been recently proposed for multi-variate data analysis. Because it allows only additive, not subtractive, combinations of the original data, NMF is capable of producing region or parts-based representation of objects. It has been used for image analysis and text processing. Unlike PCA, the resolutions of NMF are non-negative and can be easily interpreted and understood directly. Due to multiple solutions, the original algorithm of NMF [D.D. Lee, H.S. Seung, Nature 401 (1999) 788] is not suitable for resolving chemical mixed signals. In reality, NMF has never been applied to resolving chemical mixed signals. It must be modified according to the characteristics of the chemical signals, such as smoothness of spectra, unimodality of chromatograms, sparseness of mass spectra, etc. We have used the modified NMF algorithm to narrow the feasible solution region for resolving chemical signals, and found that it could produce reasonable and acceptable results for certain experimental errors, especially for overlapping chromatograms and sparse mass spectra. Simulated two-dimensional (2-D) data and real GUJINGGONG alcohol liquor GC-MS data have been resolved soundly by NMF technique. Butyl caproate and its isomeric compound (butyric acid, hexyl ester) have been identified from the overlapping spectra. The result of NMF is preferable to that of Heuristic evolving latent projections (HELP). It shows that NMF is a promising chemometric resolution method for complex samples.

[1]  H. Gampp,et al.  Calculation of equilibrium constants from multiwavelength spectroscopic data-III Model-free analysis of spectrophotometric and ESR titrations. , 1985, Talanta.

[2]  R. Synovec,et al.  Increasing the number of analyzable peaks in comprehensive two-dimensional separations through chemometrics. , 2001, Analytical chemistry.

[3]  Alexander Kai-man Leung,et al.  Analysis of the Water Soluble Constituents of Cordyceps Sinensis With Heuristic Evolving Latent Projections , 2000 .

[4]  Paul J. Gemperline,et al.  A priori estimates of the elution profiles of the pure components in overlapped liquid chromatography peaks using target factor analysis , 1984, J. Chem. Inf. Comput. Sci..

[5]  Bernt Schiele,et al.  Introducing a weighted non-negative matrix factorization for image classification , 2003, Pattern Recognit. Lett..

[6]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[7]  Bruce R. Kowalski,et al.  Generalized rank annihilation factor analysis , 1986 .

[8]  Yizeng Liang,et al.  Heuristic evolving latent projections: resolving two-way multicomponent data. 1. Selectivity, latent-projective graph, datascope, local rank, and unique resolution , 1992 .

[9]  P.M.J. Coenegracht,et al.  Comparison of several curve resolution methods for drug impurity profiling using high-performance liquid chromatography with diode array detection , 2003 .

[10]  J. R. Torres-Lapasió,et al.  Resolution of multicomponent peaks by orthogonal projection approach, positive matrix factorization and alternating least squares , 2000 .

[11]  Qingsong Xu,et al.  ON THE EQUIVALENCE OF WINDOW FACTOR ANALYSIS AND ORTHOGONAL PROJECTION RESOLUTION , 1999 .

[12]  F. Chau,et al.  Determination of volatile components in peptic powder by gas chromatography-mass spectrometry and chemometric resolution. , 2001, Journal of chromatography. A.

[13]  F. Gong,et al.  Gas chromatography-mass spectrometry and chemometric resolution applied to the determination of essential oils in Cortex cinnamomi. , 2001, Journal of chromatography. A.

[14]  Jordi Vitrià,et al.  Evaluation of distance metrics for recognition based on non-negative matrix factorization , 2003, Pattern Recognit. Lett..

[15]  Josef Kittler,et al.  Proceedings of the 4th International Conference on Pattern Recognition , 1988 .

[16]  M. Bogusz,et al.  Reversed-phase high-performance liquid chromatographic database of retention indices and UV spectra of toxicologically relevant substances and its interlaboratory use. , 1994, Journal of chromatography. A.

[17]  Smoothed window factor analysis , 1999 .