Development and application of retention time prediction models in the suspect and non-target screening of emerging contaminants.

Hydrophilic interaction liquid chromatography (HILIC) and reversed phase LC (RPLC) coupled to high resolution mass spectrometry (HRMS) are widely used for the identification of suspects and unknown compounds in the environment. For the identification of unknowns, apart from mass accuracy and isotopic fitting, retention time (tR) and MS/MS spectra evaluation is required. In this context, a novel comprehensive workflow was developed to study the tR behavior of large groups of emerging contaminants using Quantitative Structure-Retention Relationships (QSRR). 682 compounds were analyzed by HILIC-HRMS in positive Electrospray Ionization mode (ESI). Moreover, an extensive dataset was built for RPLC-HRMS including 1830 and 308 compounds for positive and negative ESI, respectively. Support Vector Machines (SVM) was used to model the tR data. The applicability domains of the models were studied by Monte Carlo Sampling (MCS) methods. The MCS method was also used to calculate the acceptable error windows for the predicted tR from various LC conditions. This paper provides validated models for predicting tR in HILIC/RPLC-HRMS platforms to facilitate identification of new emerging contaminants by suspect and non-target HRMS screening, and were applied for the identification of transformation products (TPs) of emerging contaminants and biocides in wastewater and sludge.

[1]  Janusz Pawliszyn,et al.  Quantitative structure-retention relationships models for prediction of high performance liquid chromatography retention time of small molecules: endogenous metabolites and banned compounds. , 2013, Analytica chimica acta.

[2]  I. Ferrer,et al.  Analysis of hydraulic fracturing additives by LC/Q-TOF-MS , 2015, Analytical and Bioanalytical Chemistry.

[3]  P. Carrupt,et al.  Retention time prediction for dereplication of natural products (CxHyOz) in LC-MS metabolite profiling. , 2014, Phytochemistry.

[4]  Thomas Letzel,et al.  Non-target screening with high-resolution mass spectrometry: critical review using a collaborative trial on water analysis , 2015, Analytical and Bioanalytical Chemistry.

[5]  Valeri I. Babushok,et al.  Retention Characteristics of Peptides in RP-LC: Peptide Retention Prediction , 2010 .

[6]  Eamonn F. Healy,et al.  Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model , 1985 .

[7]  Scott D. Kahn,et al.  Current Status of Methods for Defining the Applicability Domain of (Quantitative) Structure-Activity Relationships , 2005, Alternatives to laboratory animals : ATLA.

[8]  Oliver Kohlbacher,et al.  Retention Time Prediction Improves Identification in Nontargeted Lipidomics Approaches. , 2015, Analytical chemistry.

[9]  Leon P Barron,et al.  Artificial neural network modelling of pharmaceutical residue retention times in wastewater extracts using gradient liquid chromatography-high resolution mass spectrometry data. , 2015, Journal of chromatography. A.

[10]  Kamel Mansouri,et al.  A comparison of three liquid chromatography (LC) retention time prediction models. , 2018, Talanta.

[11]  Stellan Fischer,et al.  Suspect Screening and Regulatory Databases: A Powerful Combination To Identify Emerging Micropollutants. , 2018, Environmental science & technology.

[12]  Karl Fraser,et al.  Predicting retention time in hydrophilic interaction liquid chromatography mass spectrometry and its use for peak annotation in metabolomics , 2014, Metabolomics.

[13]  Andrea Armirotti,et al.  Kernel-Based, Partial Least Squares Quantitative Structure-Retention Relationship Model for UPLC Retention Time Prediction: A Useful Tool for Metabolite Identification. , 2016, Analytical chemistry.

[14]  Nikolaos S. Thomaidis,et al.  Targeted and non-targeted liquid chromatography-mass spectrometric workflows for identification of transformation products of emerging pollutants in the aquatic environment , 2015 .

[15]  A. Pelander,et al.  Prediction of liquid chromatographic retention for differentiation of structural isomers. , 2012, Analytica chimica acta.

[16]  R. Breitling,et al.  Toward global metabolomics analysis with hydrophilic interaction liquid chromatography-mass spectrometry: improved metabolite identification by retention time prediction. , 2011, Analytical chemistry.

[17]  Adrian D Hegeman,et al.  Easy and accurate high-performance liquid chromatography retention prediction with different gradients, flow rates, and instruments by back-calculation of gradient and flow rate profiles. , 2011, Journal of chromatography. A.

[18]  Reza Aalizadeh,et al.  Quantitative Structure-Retention Relationship Models To Support Nontarget High-Resolution Mass Spectrometric Screening of Emerging Contaminants in Environmental Samples , 2016, J. Chem. Inf. Model..

[19]  M. Loos,et al.  Quantitative target and systematic non-target analysis of polar organic micro-pollutants along the river Rhine using high-resolution mass-spectrometry--Identification of unknown sources and compounds. , 2015, Water research.

[20]  Reza Aalizadeh,et al.  Extended Suspect and Non-Target Strategies to Characterize Emerging Polar Organic Contaminants in Raw Wastewater with LC-HRMS/MS. , 2015, Environmental science & technology.

[21]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[22]  Walter Thiel,et al.  Semiempirical quantum–chemical methods , 2014 .

[23]  T. Ternes,et al.  Water analysis: emerging contaminants and current issues. , 2003, Analytical chemistry.

[24]  N. Thomaidis,et al.  Assessment of the Acute Toxicity, Uptake and Biotransformation Potential of Benzotriazoles in Zebrafish ( Danio rerio) Larvae Combining HILIC- with RPLC-HRMS for High-Throughput Identification. , 2018, Environmental science & technology.

[25]  Dong-Sheng Cao,et al.  A new strategy of outlier detection for QSAR/QSPR , 2009, J. Comput. Chem..

[26]  Emma L. Schymanski,et al.  Identifying small molecules via high resolution mass spectrometry: communicating confidence. , 2014, Environmental science & technology.

[27]  Martin Krauss,et al.  LC–high resolution MS in environmental analysis: from target screening to the identification of unknowns , 2010, Analytical and bioanalytical chemistry.

[28]  Alban Arrault,et al.  UPLC–MS retention time prediction: a machine learning approach to metabolite identification in untargeted profiling , 2015, Metabolomics.

[29]  Nikiforos A Alygizakis,et al.  Occurrence and spatial distribution of 158 pharmaceuticals, drugs of abuse and related metabolites in offshore seawater. , 2016, The Science of the total environment.

[30]  R. Beavis,et al.  An Improved Model for Prediction of Retention Times of Tryptic Peptides in Ion Pair Reversed-phase HPLC , 2004, Molecular & Cellular Proteomics.

[31]  M. Zhang,et al.  Biocides in wastewater treatment plants: Mass balance analysis and pollution load estimation. , 2017, Journal of hazardous materials.

[32]  Fiorella Ruggiu,et al.  Quantitative structure-property relationship modeling: a valuable support in high-throughput screening quality control. , 2014, Analytical chemistry.

[33]  Lubertus Bijlsma,et al.  Suspect screening of large numbers of emerging contaminants in environmental waters using artificial neural networks for chromatographic retention time prediction and high resolution mass spectrometry data analysis. , 2015, The Science of the total environment.

[34]  G. Jiang,et al.  Identification and composition of emerging quaternary ammonium compounds in municipal sewage sludge in China. , 2014, Environmental science & technology.

[35]  Leon P Barron,et al.  Prediction of chromatographic retention time in high-resolution anti-doping screening data using artificial neural networks. , 2013, Analytical chemistry.

[36]  A. Ghose,et al.  Prediction of Hydrophobic (Lipophilic) Properties of Small Organic Molecules Using Fragmental Methods: An Analysis of ALOGP and CLOGP Methods , 1998 .

[37]  N. Thomaidis,et al.  Ozonation of ranitidine: Effect of experimental parameters and identification of transformation products. , 2016, The Science of the total environment.

[38]  N. Thomaidis,et al.  Identification of biotransformation products of citalopram formed in activated sludge. , 2016, Water research.

[39]  Martin Krauss,et al.  Performance of combined fragmentation and retention prediction for the identification of organic micropollutants by LC-HRMS , 2018, Analytical and Bioanalytical Chemistry.

[40]  Heinz Singer,et al.  Alleviating the reference standard dilemma using a systematic exact mass suspect screening approach with liquid chromatography-high resolution mass spectrometry. , 2013, Analytical chemistry.

[41]  Marilena E. Dasenaki,et al.  Multianalyte method for the determination of pharmaceuticals in wastewater samples using solid-phase extraction and liquid chromatography–tandem mass spectrometry , 2015, Analytical and Bioanalytical Chemistry.

[42]  Emma L. Schymanski,et al.  Retention projection enables accurate calculation of liquid chromatographic retention times across labs and methods. , 2015, Journal of chromatography. A.

[43]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[44]  Pablo Gago-Ferrero,et al.  Simultaneous determination of 148 pharmaceuticals and illicit drugs in sewage sludge based on ultrasound-assisted extraction and liquid chromatography–tandem mass spectrometry , 2015, Analytical and Bioanalytical Chemistry.

[45]  M. Ibáñez,et al.  UHPLC-QTOF MS screening of pharmaceuticals and their metabolites in treated wastewater samples from Athens. , 2017, Journal of hazardous materials.

[46]  M. Zečević,et al.  Quantitative structure retention relationship modeling in liquid chromatography method for separation of candesartan cilexetil and its degradation products , 2015 .