A Novel Multi-model Data-Driven Ensemble Technique for the Prediction of Retention Factor in HPLC Method Development

Reliable simulation of retention factor (k) is crucial in high-performance liquid chromatography (HPLC) method development. In this research, three different Artificial intelligence (AI) based models, namely the multi-layer perceptron (MLP), Support vector machine (SVM) and Hammerstein–Weiner (HW) models, were employed as well as three ensemble techniques, i.e., neural network ensemble (NNE), weighted average ensemble (WAE) and simple average ensemble (SAE) to predict k for HPLC method development. In this context, the pH and composition of the mobile phase (methanol) are used as the input variables with the corresponding Methyclothiazide (M) and Amiloride (A) as antihypertensive target analytes. The performance efficiency of the models was evaluated using mean square error (MSE), determination coefficient (R2), and correlation coefficient (R). The results obtained from the single models showed that MLP outperformed the other two models and increased the prediction accuracy up to 1% and 3% for the HW and SVM models, respectively, for the prediction of M. However, for the prediction of A, SVM outperformed the other two models and increased the prediction accuracy up to 7% and 6% for HW and MLP, respectively. In the ensemble technique, the results obtained for the prediction of both M and A demonstrated that NNE increased the performance accuracy by 14% of the single models. Also, NNE proved to be superior to the two linear ensembles and improved the prediction accuracy up to 14% and 2% for SAE and WAE, respectively, for the simulation of M with R2 = 0.9962 and 0.9949 for both calibration and verification, and up to 9% and 6% for A with R2 = 0.9606 and 0.9569 for both calibration and verification phases respectively. The overall results depicted the reliability and robustness of both the AI-based models and justified the enhancement capability for ensemble techniques for both the two analytes.

[1]  Bahram Choubin,et al.  Multiple linear regression, multi-layer perceptron network and adaptive neuro-fuzzy inference system for forecasting precipitation based on large-scale climate signals , 2016 .

[2]  Mira Zečević,et al.  Application of neural networks for response surface modeling in HPLC optimization , 1998 .

[3]  Vahid Nourani,et al.  Wastewater treatment plant performance analysis using artificial intelligence - an ensemble approach. , 2018, Water science and technology : a journal of the International Association on Water Pollution Research.

[4]  G. Esposito,et al.  Machine Learning Algorithms for the Forecasting of Wastewater Quality Indicators , 2017 .

[5]  Zaher Mundher Yaseen,et al.  A Novel Hybrid Evolutionary Data-Intelligence Algorithm for Irrigation and Power Production Management: Application to Multi-Purpose Reservoir Systems , 2019, Sustainability.

[6]  Maria G. Kouskoura,et al.  Modeling the drugs' passive transfer in the body based on their chromatographic behavior. , 2014, Journal of pharmaceutical and biomedical analysis.

[7]  Ukashatu Abubakar,et al.  Modelling of Uncertain System: A comparison study of Linear and Non-Linear Approaches , 2019, 2019 IEEE International Conference on Automatic Control and Intelligent Systems (I2CACIS).

[8]  K. Nikolić,et al.  Chemometrically assisted RP-HPLC method development for efficient separation of ivabradine and its eleven impurities , 2020, Acta Chromatographica.

[9]  S. I. Abba,et al.  Simulation for response surface in the HPLC optimization method development using artificial intelligence models: A data-driven approach , 2020 .

[10]  Fen Guo,et al.  A new identification method for Wiener and Hammerstein Systems , 2004 .

[11]  Oliver Kohlbacher,et al.  Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics , 2007, BMC Bioinformatics.

[12]  Abazar Solgi,et al.  Improving SVR and ANFIS performance using wavelet transform and PCA algorithm for modeling and predicting biochemical oxygen demand (BOD) , 2017 .

[13]  A. A. D’Archivio,et al.  Cross-column prediction of gas-chromatographic retention of polybrominated diphenyl ethers. , 2013, Journal of chromatography. A.

[14]  Muhammad Sani Gaya,et al.  Estimation of Turbidity in Water Treatment Plant using Hammerstein-Wiener and Neural Network Technique , 2017 .

[15]  Jazuli Abdullahi,et al.  River water modelling prediction using multi-linear regression, artificial neural network, and adaptive neuro-fuzzy inference system techniques , 2017 .

[16]  Haidy A. Gad,et al.  Authentication and discrimination of green tea samples using UV–vis, FTIR and HPLC techniques coupled with chemometrics analysis , 2019, Journal of pharmaceutical and biomedical analysis.

[17]  Qinqin Chai,et al.  Geographical Origin Identification for Tetrastigma Hemsleyanum Based on High Performance Liquid Chromatographic Fingerprint , 2019, 2019 Chinese Automation Congress (CAC).

[18]  N. Null Artificial Neural Networks in Hydrology. I: Preliminary Concepts , 2000 .

[19]  John W. Dolan,et al.  High-Performance Gradient Elution: The Practical Application of the Linear-Solvent-Strength Model , 2006 .

[20]  S. Agatonovic-Kustrin,et al.  Use of ANN modelling in structure--retention relationships of diuretics in RP-HPLC. , 1999, Journal of pharmaceutical and biomedical analysis.

[21]  A. El-Brashy,et al.  Optimization and modeling of a green dual detected RP-HPLC method by UV and fluorescence detectors using two level full factorial design for simultaneous determination of sofosbuvir and ledipasvir: Application to average content and uniformity of dosage unit testing , 2019, Microchemical Journal.

[22]  Nura Muhammad Baba,et al.  CURRENT ISSUES IN ENSEMBLE METHODS AND ITS APPLICATIONS , 2015 .

[23]  Vahid Nourani,et al.  Multi-parametric modeling of water treatment plant using AI-based non-linear ensemble , 2019, Journal of Water Supply: Research and Technology-Aqua.

[24]  Y. Marrero-Ponce,et al.  QSRR prediction of gas chromatography retention indices of essential oil components , 2017, Chemical Papers.

[25]  Eyyüp Gülbandilar,et al.  Prediction of the effects of fly ash and silica fume on the setting time of Portland cement with fuzzy logic , 2012, Neural Computing and Applications.

[26]  Vladimir Naumovich Vapni The Nature of Statistical Learning Theory , 1995 .

[27]  Farshad Rahimpour,et al.  Retention time modeling of short-chain aliphatic acids in aqueous ion-exclusion chromatography systems under several conditions using computational intelligence methods (artificial neural network and adaptive neuro-fuzzy inference system) , 2018, Journal of Liquid Chromatography & Related Technologies.

[28]  Manhua Liu,et al.  SVM Based Classification and Prediction System for Gastric Cancer Using Dominant Features of Saliva , 2019 .

[29]  Antonije Onjia,et al.  Optimization of artificial neural network for retention modeling in high-performance liquid chromatography. , 2004, Talanta.

[30]  V. Hatzimanikatis,et al.  Antihypertensive Drugs Metabolism: An Update to Pharmacokinetic Profiles and Computational Approaches , 2014, Current pharmaceutical design.

[31]  Vahid Nourani,et al.  Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach , 2019, Journal of Hydrology.

[32]  P. Richter,et al.  Simultaneous determination of amiloride and furosemide in pharmaceutical formulations by first digital derivative spectrophotometry. , 2002, International journal of pharmaceutics.

[33]  Anurag Malik,et al.  Potential of Hybrid Data-Intelligence Algorithms for Multi-Station Modelling of Rainfall , 2019, Water Resources Management.

[34]  Vijay P. Singh,et al.  Modeling daily soil temperature using data-driven models and spatial distribution , 2014, Theoretical and Applied Climatology.

[35]  Effat Dehghanian,et al.  Comparison of single best artificial neural network and neural network ensemble in modeling of palladium microextraction , 2015, Monatshefte für Chemie - Chemical Monthly.

[36]  Han-seung Lee,et al.  Improvement of Early Strength of Cement Mortar Containing Granulated Blast Furnace Slag Using Industrial Byproducts , 2017, Materials.

[37]  S. Agatonovic-Kustrin,et al.  Application of artificial neural networks in HPLC method development. , 1998, Journal of pharmaceutical and biomedical analysis.

[38]  Jazuli Abdullahi,et al.  Artificial intelligence-based approaches for multi-station modelling of dissolve oxygen in river , 2018 .

[39]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[40]  W. D. dos Santos,et al.  Screening of Mangifera indica L. functional content using PCA and neural networks (ANN). , 2018, Food chemistry.

[41]  Nguyen Thi Thuy Linh,et al.  Emerging evolutionary algorithm integrated with kernel principal component analysis for modeling the performance of a water treatment plant , 2020 .

[42]  Josiah Poon,et al.  Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints , 2015, BMC Bioinformatics.

[43]  C. Patro,et al.  An overview of experimental designs in HPLC method development and validation , 2018, Journal of pharmaceutical and biomedical analysis.

[44]  A. A. D’Archivio,et al.  Artificial neural network prediction of multilinear gradient retention in reversed-phase HPLC: comprehensive QSRR-based models combining categorical or structural solute descriptors and gradient profile parameters , 2015, Analytical and Bioanalytical Chemistry.

[45]  Determination of phenolic composition of oilseed whole flours by HPLC-DAD with evaluation using chemometric analyses , 2020, Microchemical Journal.

[46]  Georg Bretthauer,et al.  Identification of MISO Wiener and Hammerstein systems , 2003, 2003 European Control Conference (ECC).

[47]  A. Weerts,et al.  Ensemble data assimilation methods for improving river water quality forecasting accuracy. , 2019, Water research.

[48]  Farzana Anowar,et al.  Multi-class Ensemble Learning of Imbalanced Bidding Fraud Data , 2019, Canadian Conference on AI.

[49]  H. M. Maher,et al.  Application of artificial neural networks for response surface modelling in HPLC method development , 2012 .

[50]  Kathleen Marchal,et al.  Validating module network learning algorithms using simulated data , 2007, BMC Bioinformatics.

[51]  P. Haddad,et al.  Retention prediction of low molecular weight anions in ion chromatography based on quantitative structure-retention relationships applied to the linear solvent strength model. , 2017, Journal of chromatography. A.

[52]  M. Omid,et al.  Egg Quality Prediction Using Dielectric and Visual Properties Based on Artificial Neural Network , 2015, Food Analytical Methods.

[53]  Leon P Barron,et al.  Artificial neural network modelling of pharmaceutical residue retention times in wastewater extracts using gradient liquid chromatography-high resolution mass spectrometry data. , 2015, Journal of chromatography. A.

[54]  Hui Li,et al.  A new application of WT-ANN method to control the preparation process of metformin hydrochloride tablets by near infrared spectroscopy compared to PLS. , 2013, Journal of pharmaceutical and biomedical analysis.