Adaptive wavelet packet transform for support vector machine modeling as globally optimized by particle swarm optimization algorithm

Multivariate calibration algorithms and statistical methods are typically required in quantitative near infrared (NIR) spectroscopic methods to relate NIR spectral response to the chemical or physical properties of samples. Based on two different versions of particle swarm optimization (PSO) algorithms, an adaptively configured wavelet packet transform (WPT) based support vector machine (SVM) was developed and applied to quantitative near-infrared (NIR) spectral data analysis. WPT has been proved to be an effective method in extracting feature information from raw NIR spectra. It is also useful in noise suppression and data compression. SVM is famous for its function approximation ability and remarkable generalization performance. For SVM modeling on the basis of WPT, discrete PSO and continuous PSO were used synchronously to optimize the structure of a WPT tree and the parameters of SVM synergistically according to the performance of the total model. It, thus, enables an adaptive and parameter-free model construction technology for NIR spectral data analysis. The performance of the proposed Ad-WPT-SVM is investigated using two real datasets. The results of different methods are compared, indicating that the proposed method holds great potential for robust and reliable quantitative NIR spectral data analysis.

[1]  Yuan Xi-Gang,et al.  An improved PSO algorithm for solving non-convex NLP/MINLP problems with equality constraints , 2007 .

[2]  C. Ruckebusch,et al.  Genetic algorithm optimisation combined with partial least squares regression and mutual information variable selection procedures in near-infrared quantitative analysis of cotton-viscose textiles. , 2007, Analytica chimica acta.

[3]  H. Adeli,et al.  Wavelet-based EEG processing for computer-aided seizure detection and epilepsy diagnosis , 2015, Seizure.

[4]  Åsmund Rinnan,et al.  Recursive weighted partial least squares (rPLS): an efficient variable selection method using PLS , 2014 .

[5]  Alex Mogilner,et al.  Mechanics of Motor Proteins and the Cytoskeleton , 2002 .

[6]  S. Wold,et al.  Wavelength interval selection in multicomponent spectral analysis by moving window partial least-squares regression with applications to mid-infrared and near-infrared spectroscopic data. , 2002, Analytical chemistry.

[7]  Ann-Christine Ehlis,et al.  Application of functional near-infrared spectroscopy in psychiatry , 2014, NeuroImage.

[8]  Guo-Li Shen,et al.  Modified particle swarm optimization algorithm for variable selection in MLR and PLS modeling: QSAR studies of antagonism of angiotensin II antagonists. , 2004, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[9]  D. Agrafiotis,et al.  Feature selection for structure-activity correlation using binary particle swarms. , 2002, Journal of medicinal chemistry.

[10]  S. Engelsen,et al.  Interval Partial Least-Squares Regression (iPLS): A Comparative Chemometric Study with an Example from Near-Infrared Spectroscopy , 2000 .

[11]  E. C. Ellis,et al.  Source resolution of the fine carbonaceous aerosol by principal component-stepwise regression analysis. , 1988, Environmental science & technology.

[12]  C. Gendrin,et al.  Pharmaceutical applications of vibrational chemical imaging and chemometrics: a review. , 2008, Journal of pharmaceutical and biomedical analysis.

[13]  Jian-hui Jiang,et al.  Spectral regions selection to improve prediction ability of PLS models by changeable size moving window partial least squares and searching combination moving window partial least squares , 2004 .

[14]  Masoud Mohseni,et al.  Quantum support vector machine for big feature and big data classification , 2013, Physical review letters.

[15]  S. Qin Recursive PLS algorithms for adaptive data modeling , 1998 .

[16]  D. Massart,et al.  Application of Wavelet Packet Transform in Pattern Recognition of Near-IR Data , 1996 .

[17]  J. Callis,et al.  Prediction of gasoline octane numbers from near-infrared spectral features in the range 660-1215 nm , 1989 .

[18]  S. Šašiċ,et al.  Detailed analysis of the online near-infrared spectra of pharmaceutical blend in a rotary tablet press feed frame. , 2015, Journal of pharmaceutical and biomedical analysis.

[19]  W. Cai,et al.  A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra , 2008 .

[20]  Pedro Rios,et al.  Plastics disassembly versus bulk recycling: engineering design for end-of-life electronics resource recovery. , 2003, Environmental science & technology.

[21]  Li-Yeh Chuang,et al.  Improved binary PSO for feature selection using gene expression data , 2008, Comput. Biol. Chem..

[22]  R. Leardi,et al.  Sequential application of backward interval partial least squares and genetic algorithms for the selection of relevant spectral regions , 2004 .

[23]  S. Wold,et al.  PLS regression on wavelet compressed NIR spectra , 1998 .

[24]  D. Massart,et al.  Near-infrared spectroscopy applications in pharmaceutical analysis. , 2007, Talanta.

[25]  Fang Wang,et al.  A method for near-infrared spectral calibration of complex plant samples with wavelet transform and elimination of uninformative variables , 2004, Analytical and bioanalytical chemistry.

[26]  Di Wu,et al.  Application of near infrared spectroscopy for the rapid determination of antioxidant activity of bamboo leaf extract. , 2012, Food chemistry.

[27]  R. Yu,et al.  An ensemble of Monte Carlo uninformative variable elimination for wavelength selection. , 2008, Analytica chimica acta.