Analysis of Deoxyribonucleotides with Principal Component and Partial Least-Squares Regression of UV Spectra after Fourier Preprocessing

Principal component (PCR) and partial least-squares (PLS) regression with Fourier preprocessing of ultraviolet spectra have been evaluated for the analysis of mixtures of the four major nucleotides of DNA. Results have been compared with those obtained without Fourier preprocessing and with those obtained with a straightforward P-matrix approach with optimum wavelength selection. Calibrations were developed and tested with the use of sixteen standard and fourteen unknown mixtures. The standards and nine of the unknowns followed the typical base equivalences found in DNA. For these nine unknowns, the best average percent error in the Fourier domain was 0.61%, as compared to 0.65% in the spectral domain. The remaining five unknowns, in which the nucleotide concentrations were independent of each other, provided a particularly difficult test of each method's ability to distinguish the components. For these unknowns, the best errors in the Fourier and spectral domains were 1.20 and 1.85%, respectively. For assessment of the effect of protein, the analysis was repeated with varying amounts of albumin up to 36% by weight added to the mixtures. All of the full spectral methods (PCR and PLS in the spectral and Fourier domains) were able to compensate for the spectral interference of the added protein and produced results as good as those without the protein. For nucleotide mixtures, ultraviolet spectroscopy represents a rapid, economical, sensitive, and nondestructive alternative to existing techniques.

[1]  Tormod Næs,et al.  Comparison of prediction methods for multicollinear data , 1985 .

[2]  J. Koenig,et al.  Factor Analysis Applied to Fourier Transform Infrared Spectra , 1979 .

[3]  Chris W. Brown,et al.  Multicomponent Analysis Using Fourier Transform Infrared and UV Spectra , 1988 .

[4]  E. G. Richards,et al.  A simplified method for the determination of the nucleotide composition of polyribonucleotides by spectrophotometric analysis. , 1968, European journal of biochemistry.

[5]  P. R. Osborn,et al.  Materials Characterization Using Factor Analysis of FT-IR Spectra. Part 2: Mathematical and Statistical Considerations , 1985 .

[6]  J. Koenig,et al.  Factor Analysis and Fourier Transform Infrared Spectra of Compatible Blends of Polyphenylene Oxides and Polystyrene , 1981 .

[7]  J. Besson,et al.  Optimization of the determination of RNA nucleotide composition by ultraviolet spectral analysis. , 1986, Journal of biochemical and biophysical methods.

[8]  L. E. Wangen,et al.  A theoretical foundation for the PLS algorithm , 1987 .

[9]  Wolfhard Wegscheider,et al.  Spectrophotometric multicomponent analysis applied to trace metal determinations , 1985 .

[10]  T. Hirschfeld,et al.  Near-Infrared Reflectance Analysis by Gauss-Jordan Linear Algebra , 1983 .

[11]  Chris W. Brown,et al.  Accounting for Impurities in Spectroscopic Multicomponent Analysis Using Fourier Vectors , 1986 .

[12]  B. Kowalski,et al.  Partial least-squares regression: a tutorial , 1986 .

[13]  A. W. Pratt,et al.  Computer-assisted characterization of oligoribonucleotides by their ultraviolet absorption. , 1968, Analytical biochemistry.

[14]  Chris W. Brown,et al.  Matrix representations and criteria for selecting analytical wavelengths for multicomponent spectroscopic analysis , 1982 .

[15]  Svante Wold,et al.  Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and lignin sulfonate , 1983 .

[16]  Chris W. Brown,et al.  Infrared Quantitative Analysis in the Fourier Domain: Processing Vector Representations , 1986 .