Models and Methods for Quantitative Analysis of Surface-Enhanced Raman Spectra

The quantitative analysis of surface-enhanced Raman spectra using scattering nanoparticles has shown the potential and promising applications in in vivo molecular imaging. The diverse approaches have been used for quantitative analysis of Raman spectra information, which can be categorized as direct classical least squares models, full spectrum multivariate calibration models, selected multivariate calibration models, and latent variable regression (LVR) models. However, the working principle of these methods in the Raman spectra application remains poorly understood and a clear picture of the overall performance of each model is missing. Based on the characteristics of the Raman spectra, in this paper, we first provide the theoretical foundation of the aforementioned commonly used models and show why the LVR models are more suitable for quantitative analysis of the Raman spectra. Then, we demonstrate the fundamental connections and differences between different LVR methods, such as principal component regression, reduced-rank regression, partial least square regression (PLSR), canonical correlation regression, and robust canonical analysis, by comparing their objective functions and constraints. We further prove that PLSR is literally a blend of multivariate calibration and feature extraction model that relates concentrations of nanotags to spectrum intensity. These features (a.k.a. latent variables) satisfy two purposes: the best representation of the predictor matrix and correlation with the response matrix. These illustrations give a new understanding of the traditional PLSR and explain why PLSR exceeds other methods in quantitative analysis of the Raman spectra problem. In the end, all the methods are tested on the Raman spectra datasets with different evaluation criteria to evaluate their performance.

[1]  D. Jacobs,et al.  Bypassing synthesis: PLS for face recognition with pose, low-resolution and sketch , 2011, CVPR 2011.

[2]  J. Kettenring,et al.  Canonical Analysis of Several Sets of Variables , 2022 .

[3]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[4]  M. Tso Reduced‐Rank Regression and Canonical Analysis , 1981 .

[5]  Herman Wold,et al.  Soft modelling: The Basic Design and Some Extensions , 1982 .

[6]  Stan Lipovetsky,et al.  Modelling and forecasting with robust canonical analysis: method and application , 2000, Comput. Oper. Res..

[7]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[8]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[9]  John R. Lombardi,et al.  A Unified Approach to Surface-Enhanced Raman Spectroscopy , 2008 .

[10]  Karl J. Friston,et al.  Characterizing the Response of PET and fMRI Data Using Multivariate Linear Models , 1997, NeuroImage.

[11]  S. Gambhir,et al.  Noninvasive molecular imaging of small living subjects using Raman spectroscopy , 2008, Proceedings of the National Academy of Sciences.

[12]  Kaare Brandt Petersen,et al.  Sparse Kernel Orthonormalized PLS for feature extraction in large data sets , 2006, NIPS.

[13]  Lei Zhang,et al.  Quantitative analysis of thymine with surface-enhanced Raman spectroscopy and partial least squares (PLS) regression , 2010, Analytical and bioanalytical chemistry.

[14]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[15]  Sijmen de Jong,et al.  The objective function of partial least squares regression , 1998 .

[16]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[17]  Royston Goodacre,et al.  Quantitative Analysis of the Banned Food Dye Sudan-1 Using Surface Enhanced Raman Scattering with Multivariate Chemometrics† , 2010 .

[18]  Bong-Hyun Jun,et al.  Nanoparticle probes with surface enhanced Raman spectroscopic tags for cellular cancer targeting. , 2006, Analytical chemistry.

[19]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[20]  R. Bro Multivariate calibration: What is in chemometrics for the analytical chemist? , 2003 .

[21]  Patrick A Johnson,et al.  Surface-enhanced Raman scattering detection of DNA derived from the west nile virus genome using magnetic capture of Raman-active gold nanoparticles. , 2011, Analytical chemistry.

[22]  A. Höskuldsson PLS regression methods , 1988 .

[23]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[24]  Duncan Graham,et al.  Quantitative SERRS for DNA sequence analysis. , 2008, Chemical Society reviews.

[25]  Barbara Rasco,et al.  Application of surface enhanced Raman spectroscopy for analyses of restricted sulfa drugs , 2011 .

[26]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[27]  Stan Lipovetsky,et al.  Identifying critical success factors in defense development projects: A multivariate analysis , 1996 .

[28]  Valeriu Vrabie,et al.  Preprocessing Methods of Raman Spectra for Source Extraction on Biomedical Samples: Application on Paraffin-Embedded Skin Biopsies , 2009, IEEE Transactions on Biomedical Engineering.

[29]  Larry S. Davis,et al.  Vehicle Detection Using Partial Least Squares , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Jacob A. Wegelin,et al.  A Survey of Partial Least Squares (PLS) Methods, with Emphasis on the Two-Block Case , 2000 .

[31]  Carl A Batt,et al.  Detection of carbendazim by surface-enhanced Raman scattering using cyclodextrin inclusion complexes on gold nanorods. , 2009, Analytical chemistry.

[32]  S. Bell,et al.  Quantitative surface-enhanced Raman spectroscopy. , 2008, Chemical Society reviews.

[33]  M. Pelletier,et al.  Quantitative Analysis Using Raman Spectrometry , 2003, Applied spectroscopy.

[34]  J. Mo,et al.  Baseline correction by improved iterative polynomial fitting with automatic threshold , 2006 .

[35]  Yu-Tao Xie Surface-enhanced hyper raman and surface-enhanced raman scattering : novel substrates, surface probing molecules and chemical applications , 2007 .

[36]  Jieping Ye,et al.  Finite Domain Constraint Solver Learning , 2009, IJCAI.

[37]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[38]  John Shawe-Taylor,et al.  Efficient Sparse Kernel Feature Extraction Based on Partial Least Squares , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Rolf Sundberg Small sample and selection bias effects in calibration under latent factor regression models , 2007 .

[40]  H. Wold Path Models with Latent Variables: The NIPALS Approach , 1975 .

[41]  Sanjiv S. Gambhir,et al.  Multiplexed imaging of surface enhanced Raman scattering nanotags in living mice using noninvasive Raman spectroscopy , 2009, Proceedings of the National Academy of Sciences.

[42]  Duncan Graham,et al.  Quantitative enhanced Raman scattering of labeled DNA from gold and silver nanoparticles. , 2007, Small.

[43]  Richard G. Brereton,et al.  Introduction to multivariate calibration in analytical chemistry , 2000 .

[44]  Jean X. Gao,et al.  Probabilistic Partial Least Square Regression: A Robust Model for Quantitative Analysis of Raman Spectroscopy Data , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[45]  Detlef Schiel,et al.  Surface-enhanced Raman scattering based approach for quantitative determination of creatinine in human serum. , 2005, Analytical chemistry.

[46]  S. Sonja Sekulic,et al.  Multivariate data analysis for Raman imaging of a model pharmaceutical tablet , 2005 .

[47]  Alison J. Burnham,et al.  Frameworks for latent variable multivariate regression , 1996 .

[48]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[49]  John Paul Pezacki,et al.  Development of nanoparticle probes for multiplex SERS imaging of cell surface proteins. , 2010, Nanoscale.