Suspect screening of large numbers of emerging contaminants in environmental waters using artificial neural networks for chromatographic retention time prediction and high resolution mass spectrometry data analysis.

The recent development of broad-scope high resolution mass spectrometry (HRMS) screening methods has resulted in a much improved capability for new compound identification in environmental samples. However, positive identifications at the ng/L concentration level rely on analytical reference standards for chromatographic retention time (tR) and mass spectral comparisons. Chromatographic tR prediction can play a role in increasing confidence in suspect screening efforts for new compounds in the environment, especially when standards are not available, but reliable methods are lacking. The current work focuses on the development of artificial neural networks (ANNs) for tR prediction in gradient reversed-phase liquid chromatography and applied along with HRMS data to suspect screening of wastewater and environmental surface water samples. Based on a compound tR dataset of >500 compounds, an optimized 4-layer back-propagation multi-layer perceptron model enabled predictions for 85% of all compounds to within 2min of their measured tR for training (n=344) and verification (n=100) datasets. To evaluate the ANN ability for generalization to new data, the model was further tested using 100 randomly selected compounds and revealed 95% prediction accuracy within the 2-minute elution interval. Given the increasing concern on the presence of drug metabolites and other transformation products (TPs) in the aquatic environment, the model was applied along with HRMS data for preliminary identification of pharmaceutically-related compounds in real samples. Examples of compounds where reference standards were subsequently acquired and later confirmed are also presented. To our knowledge, this work presents for the first time, the successful application of an accurate retention time predictor and HRMS data-mining using the largest number of compounds to preliminarily identify new or emerging contaminants in wastewater and surface waters.

[1]  K. Kümmerer Pharmaceuticals in the Environment , 2001 .

[2]  R. Kaliszan,et al.  Prediction of gradient retention from the linear solvent strength (LSS) model, quantitative structure‐retention relationships (QSRR), and artificial neural networks (ANN) , 2003 .

[3]  A. Hogenboom,et al.  Accurate mass screening and identification of emerging contaminants in environmental samples by liquid chromatography-hybrid linear ion trap Orbitrap mass spectrometry. , 2009, Journal of chromatography. A.

[4]  Martin Reinhard,et al.  Emerging contaminants of public health significance as water quality indicator compounds in the urban water cycle. , 2014, Environment international.

[5]  Lubertus Bijlsma,et al.  Rapid wide-scope screening of drugs of abuse, prescription drugs with potential for abuse and their metabolites in influent and effluent urban wastewater by ultrahigh pressure liquid chromatography-quadrupole-time-of-flight-mass spectrometry. , 2011, Analytica chimica acta.

[6]  Xingguo Chen,et al.  Quantitative structure-retention relationships for mycotoxins and fungal metabolites in LC-MS/MS. , 2009, Journal of separation science.

[7]  L. Snyder,et al.  Column selectivity in reversed-phase liquid chromatography III. The physico-chemical basis of selectivity. , 2002, Journal of chromatography. A.

[8]  Tania Portolés,et al.  Advancing towards universal screening for organic pollutants in waters. , 2015, Journal of hazardous materials.

[9]  A. Tsantili-Kakoulidou,et al.  Quantitative Structure–Retention Relationships as Useful Tool to Characterize Chromatographic Systems and Their Potential to Simulate Biological Processes , 2013, Chromatographia.

[10]  René P Schwarzenbach,et al.  Identification of transformation products of organic contaminants in natural waters by computer-aided prediction and high-resolution mass spectrometry. , 2009, Environmental science & technology.

[11]  Philip H Howard,et al.  Identifying new persistent and bioaccumulative organics among chemicals in commerce. III: byproducts, impurities, and transformation products. , 2013, Environmental science & technology.

[12]  Igor V. Tetko,et al.  Virtual Computational Chemistry Laboratory – Design and Description , 2005, J. Comput. Aided Mol. Des..

[13]  Martin Krauss,et al.  LC–high resolution MS in environmental analysis: from target screening to the identification of unknowns , 2010, Analytical and bioanalytical chemistry.

[14]  T. T. ter Laak,et al.  Prediction of concentration levels of metformin and other high consumption pharmaceuticals in wastewater and regional surface water based on sales data. , 2013, The Science of the total environment.

[15]  Petar Žuvela,et al.  Development of Gradient Retention Model in Ion Chromatography. Part I: Conventional QSRR Approach , 2014, Chromatographia.

[16]  K. Héberger Quantitative structure-(chromatographic) retention relationships. , 2007, Journal of chromatography. A.

[17]  T. Croley,et al.  The Chromatographic Role in High Resolution Mass Spectrometry for Non-Targeted Analysis , 2012, Journal of the American Society for Mass Spectrometry.

[18]  H. Sakamoto,et al.  Behavior of Pharmaceuticals in Waste Water Treatment Plant in Japan , 2011, Bulletin of environmental contamination and toxicology.

[19]  Jukka Pellinen,et al.  Critical evaluation of screening techniques for emerging environmental contaminants based on accurate mass measurements with time-of-flight mass spectrometry. , 2012, Journal of mass spectrometry : JMS.

[20]  Félix Hernández,et al.  Multi-class determination of around 50 pharmaceuticals, including 26 antibiotics, in environmental and wastewater samples by ultra-high performance liquid chromatography-tandem mass spectrometry. , 2011, Journal of chromatography. A.

[21]  Emma L. Schymanski,et al.  Identifying small molecules via high resolution mass spectrometry: communicating confidence. , 2014, Environmental science & technology.

[22]  Imma Ferrer,et al.  Analysis of 100 pharmaceuticals and their degradates in water samples by liquid chromatography/quadrupole time-of-flight mass spectrometry. , 2012, Journal of chromatography. A.

[23]  Leon P Barron,et al.  Artificial neural network modelling of pharmaceutical residue retention times in wastewater extracts using gradient liquid chromatography-high resolution mass spectrometry data. , 2015, Journal of chromatography. A.

[24]  J. Namieśnik,et al.  The Current State-of-the-Art in the Determination of Pharmaceutical Residues in Environmental Matrices Using Hyphenated Techniques , 2014, Critical reviews in analytical chemistry.

[25]  M. Hirai,et al.  MassBank: a public repository for sharing mass spectral data for life sciences. , 2010, Journal of mass spectrometry : JMS.

[26]  J. V. Sancho,et al.  Simultaneous determination of acidic, neutral and basic pharmaceuticals in urban wastewater by ultra high-pressure liquid chromatography-tandem mass spectrometry. , 2010, Journal of chromatography. A.

[27]  M. Ibáñez,et al.  Quadrupole-time-of-flight mass spectrometry screening for synthetic cannabinoids in herbal blends. , 2013, Journal of mass spectrometry : JMS.

[28]  M. Mezcua,et al.  Rapid automated screening, identification and quantification of organic micro-contaminants and their main transformation products in wastewater and river waters using liquid chromatography-quadrupole-time-of-flight mass spectrometry with an accurate-mass database. , 2010, Journal of chromatography. A.

[29]  Brett Paull,et al.  Predicting sorption of pharmaceuticals and personal care products onto soil and digested sludge using artificial neural networks. , 2009, The Analyst.

[30]  Janusz Pawliszyn,et al.  Quantitative structure-retention relationships models for prediction of high performance liquid chromatography retention time of small molecules: endogenous metabolites and banned compounds. , 2013, Analytica chimica acta.

[31]  B. Buszewski,et al.  Development of Gradient Retention Model in Ion Chromatography. Part II: Artificial Intelligence QSRR Approach , 2014, Chromatographia.

[32]  Lubertus Bijlsma,et al.  LC-QTOF MS screening of more than 1,000 licit and illicit drugs and their metabolites in wastewater and surface waters from the area of Bogotá, Colombia , 2015, Analytical and Bioanalytical Chemistry.

[33]  Lubertus Bijlsma,et al.  Critical evaluation of a simple retention time predictor based on LogKow as a complementary tool in the identification of emerging contaminants in water. , 2015, Talanta.

[34]  D. Calamari,et al.  Pharmaceuticals in the Environment in Italy: Causes, Occurrence, Effects and Control , 2006, Environmental science and pollution research international.

[35]  A. Fernández-Alba,et al.  Simultaneous screening of targeted and non-targeted contaminants using an LC-QTOF-MS system and automated MS/MS library searching. , 2014, Journal of mass spectrometry : JMS.

[36]  Ana Agüera,et al.  New trends in the analytical determination of emerging contaminants and their transformation products in environmental waters , 2013, Environmental Science and Pollution Research.

[37]  Leon P Barron,et al.  Prediction of chromatographic retention time in high-resolution anti-doping screening data using artificial neural networks. , 2013, Analytical chemistry.