Pharmaceutical Analysis Model Robustness From Bagging-PLS and PLS Using Systematic Tracking Mapping

Our work proved that processing trajectory could effectively obtain a more reliable and robust quantitative model compared with the step-by-step optimization method. The use of systematic tracking was investigated as a tool to optimize modeling parameters including calibration method, spectral pretreatment and variable selection latent factors. The variable was selected by interval partial least-squares (iPLS), backward interval partial least-square (BiPLS) and synergy interval partial least-squares (SiPLS). The models were established by Partial least squares (PLS) and Bagging-PLS. The model performance was assessed by using the root mean square errors of validation (RMSEP) and the ratio of standard error of prediction to standard deviation (RPD). The proposed procedure was used to develop the models for near infrared (NIR) datasets of active pharmaceutical ingredients in tablets and chlorogenic acid of Lonicera japonica solution in ethanol precipitation process. The results demonstrated the processing trajectory has great advantages and feasibility in the development and optimization of multivariate calibration models as well as the effectiveness of bagging model and variable selection to improve prediction accuracy and robustness.

[1]  N. K. Faber,et al.  Multivariate sensitivity for the interpretation of the effect of spectral pretreatment methods on near-infrared calibration model predictions. , 1999, Analytical chemistry.

[2]  Luqi Huang,et al.  NIR Rapid Assessments of Blumea balsamifera (Ai-na-xiang) in China , 2017, Molecules.

[3]  Riccardo Leardi,et al.  Application of genetic algorithms for pixel selection in multivariate image analysis for a QSAR study of trypanocidal activity for quinone compounds and design new quinone compounds , 2014 .

[4]  B. Mahanty,et al.  Spectroscopic quantitation of tetrazolium formazan in nano-toxicity assay with interval-based partial least squares regression and genetic algorithm , 2016 .

[5]  Quansheng Chen,et al.  Determination of total polyphenols content in green tea using FT-NIR spectroscopy and different PLS algorithms. , 2008, Journal of pharmaceutical and biomedical analysis.

[6]  S. Engelsen,et al.  Interval Partial Least-Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy , 2000 .

[7]  A. Garrido-Varo,et al.  Optimisation of the spectral pre-treatments used for Iberian pig fat NIR calibrations , 2007 .

[8]  K. Kachrimanis,et al.  Quantitative analysis of paracetamol polymorphs in powder mixtures by FT-Raman spectroscopy and PLS regression. , 2007, Journal of pharmaceutical and biomedical analysis.

[9]  Xinyuan Shi,et al.  Development and validation of NIR model using low-concentration calibration range: rapid analysis of Lonicera japonica solution in ethanol precipitation process , 2012 .

[10]  R. Leardi,et al.  Sequential application of backward interval partial least squares and genetic algorithms for the selection of relevant spectral regions , 2004 .

[11]  Rasmus Bro,et al.  Exploring the phenotypic expression of a regulatory proteome-altering gene by spectroscopy and chemometrics , 2001 .

[12]  M. Dyrby,et al.  Chemometric Quantitation of the Active Substance (Containing C≡N) in a Pharmaceutical Tablet Using Near-Infrared (NIR) Transmittance and NIR FT-Raman Spectra , 2002 .

[13]  Xiaping Fu,et al.  Detection of melamine in milk powders using near-infrared hyperspectral imaging combined with regression coefficient of partial least square regression model. , 2016, Talanta.

[14]  Qun Ma,et al.  NIR spectroscopy as a process analytical technology (PAT) tool for monitoring and understanding of a hydrolysis process. , 2013, Bioresource technology.

[15]  B. Üstün,et al.  Quantification of chondroitin sulfate and dermatan sulfate in danaparoid sodium by 1H NMR spectroscopy and PLS regression , 2011, Analytical and bioanalytical chemistry.

[16]  Qun Ma,et al.  A novel model selection strategy using total error concept. , 2013, Talanta.

[17]  M. Blanco,et al.  Determination of low analyte concentrations by near-infrared spectroscopy: effect of spectral pretreatments and estimation of multivariate detection limits. , 2007, Analytica chimica acta.

[18]  Qun Ma,et al.  Optimization of Parameter Selection for Partial Least Squares Model Development , 2015, Scientific Reports.

[19]  Pengcheng Nie,et al.  Hybrid variable selection in visible and near-infrared spectral analysis for non-invasive quality determination of grape juice. , 2010, Analytica chimica acta.

[20]  P. Williams The RPD Statistic: A Tutorial Note , 2014 .

[21]  Kim H. Esbensen,et al.  The RPD Myth… , 2014 .

[22]  Yang Li,et al.  A Online NIR Sensor for the Pilot-Scale Extraction Process in Fructus Aurantii Coupled with Single and Ensemble Methods , 2015, Sensors.

[23]  Marcelo Nascimento Martins,et al.  An application of subagging for the improvement of prediction accuracy of multivariate calibration models , 2006 .