Chemoinformatics at IFP Energies Nouvelles: Applications in the Fields of Energy, Transport, and Environment

The objective of the present paper is to summarize chemoinformatics based research, and more precisely, the development of quantitative structure property relationships performed at IFP Energies nouvelles (IFPEN) during the last decade. A special focus is proposed on research activities performed in the “Thermodynamics and Molecular Simulation” department, i. e. the use of multiscale molecular simulation methods in responses to projects. Molecular simulation techniques can be envisaged to supplement dataset when experimental information lacks, thus the review includes a section dedicated to molecular simulation codes, development of intermolecular potentials, and some of their possible applications. Know‐how and feedback from our experiences in terms of machine learning application for thermophysical property predictions are included in a section dealing with methodological aspects. The generic character of chemoinformatics is emphasized through applications in the fields of energy, transport, and environment, with illustrations for three IFPEN business units: “Transports”, “Energy Resources”, and “Processes”. More precisely, the review focus on different challenges such as the prediction of properties for alternative fuels, the prediction of fuel compatibility with polymeric materials, the prediction of properties for surfactants usable in chemical enhanced oil recovery, and the prediction of guest‐host interactions between gases and nanoporous materials in the frame of carbon dioxide capture or gas separation activities.

[1]  V. Lachet,et al.  Classical force field for hydrofluorocarbon molecular simulations. Application to the study of gas solubility in poly(vinylidene fluoride). , 2015, The journal of physical chemistry. A.

[2]  Hervé Toulhoat,et al.  Prediction of the Cetane Number of Diesel Compounds Using the Quantitative Structure Property Relationship , 2010 .

[3]  A. Simon‐Masseron,et al.  Adsorption of CO(2), CH(4), and N(2) on zeolitic imidazolate frameworks: experiments and simulations. , 2010, Chemistry.

[4]  A. D. Mackie,et al.  Optimized intermolecular potential for aromatic hydrocarbons based on anisotropic united atoms. III. Polyaromatic and naphthenoaromatic hydrocarbons. , 2005, The journal of physical chemistry. B.

[5]  Vijay S. Pande,et al.  Molecular graph convolutions: moving beyond fingerprints , 2016, Journal of Computer-Aided Molecular Design.

[6]  Pascal Mougin,et al.  Prediction of Density and Viscosity of Biofuel Compounds Using Machine Learning Methods , 2012 .

[7]  M. Parmentier,et al.  Simulations of the Impact of Co-injected Gases on CO2 Storage, the SIGARRR Project: First Results on Water-gas Interactions Modeling , 2014 .

[8]  P. Ungerer,et al.  Anisotropic United Atom Model Including the Electrostatic Interactions of Methylbenzenes. I. Thermodynamic and Structural Properties , 2007 .

[9]  Paola Gramatica,et al.  Real External Predictivity of QSAR Models: How To Evaluate It? Comparison of Different Validation Criteria and Proposal of Using the Concordance Correlation Coefficient , 2011, J. Chem. Inf. Model..

[10]  N. Pannacci,et al.  Prediction of Surfactants’ Properties using Multiscale Molecular Modeling Tools: A Review , 2012 .

[11]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[12]  A. Fuchs,et al.  Monte Carlo simulation of branched alkanes and long chain n -alkanes with anisotropic united atoms intermolecular potential , 2002 .

[13]  Alexandre Varnek,et al.  Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures , 2005, J. Comput. Aided Mol. Des..

[14]  L. Lin,et al.  A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.

[15]  B. Creton,et al.  Prediction of Alternative Gasoline Sorption in a Semicrystalline Poly(ethylene). , 2015, ACS combinatorial science.

[16]  M. Parmentier,et al.  Role of Impurities on CO2 Injection: Experimental and Numerical Simulations of Thermodynamic Properties of Water-salt-gas Mixtures (CO2 + Co-injected Gases) Under Geological Storage Conditions , 2013 .

[17]  B. Creton,et al.  Equivalent Alkane Carbon Number of Live Crude Oil: A Predictive Model Based on Thermodynamics , 2016 .

[18]  Alexander Tropsha,et al.  Best Practices for QSAR Model Development, Validation, and Exploitation , 2010, Molecular informatics.

[19]  A. D. Mackie,et al.  Optimized Intermolecular Potential for Aromatic Hydrocarbons Based on Anisotropic United Atoms. 2. Alkylbenzenes and Styrene , 2004 .

[20]  Philippe Ungerer,et al.  Applications of Molecular Simulation In the Oil and Gas Industry: Monte Carlo Methods , 2005 .

[21]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[22]  Philippe Ungerer,et al.  Thermodynamic behavior of the CO2+SO2 mixture: Experimental and Monte Carlo simulation studies , 2009 .

[23]  P. Ungerer,et al.  Composition Analysis and Viscosity Prediction of Complex Fuel Mixtures Using a Molecular-Based Approach , 2012 .

[24]  P. Malfreyt,et al.  A transferable force field to predict phase equilibria and surface tension of ethers and glycol ethers. , 2011, The journal of physical chemistry. B.

[25]  C. Nieto‐Draghi,et al.  Prediction of thermodynamic properties of adsorbed gases in zeolitic imidazolate frameworks , 2012 .

[26]  A. D. Mackie,et al.  Transferable force field for equilibrium and transport properties in linear and branched monofunctional and multifunctional amines. II. Secondary and tertiary amines. , 2012, The journal of physical chemistry. B.

[27]  Nicolas Jeuland,et al.  Rational Formulation of Alternative Fuels using QSPR Methods: Application to Jet Fuels , 2013 .

[28]  Diego Saldana Miranda,et al.  Prediction of Flash Points for Fuel Mixtures Using Machine Learning and a Novel Equation , 2013 .

[29]  Diego Saldana Miranda,et al.  On the rational formulation of alternative fuels: melting point and net heat of combustion predictions for fuel compounds using machine learning methods , 2013, SAR and QSAR in environmental research.

[30]  A. D. Mackie,et al.  Optimized Intermolecular Potential for Aromatic Hydrocarbons Based on Anisotropic United Atoms. 1. Benzene , 2004 .

[31]  Igor I. Baskin,et al.  Machine Learning Methods for Property Prediction in Chemoinformatics: Quo Vadis? , 2012, J. Chem. Inf. Model..

[32]  Tomoko Niwa,et al.  Using General Regression and Probabilistic Neural Networks To Predict Human Intestinal Absorption with Topological Descriptors Derived from Two-Dimensional Chemical Structures , 2003, J. Chem. Inf. Comput. Sci..

[33]  G. Kontogeorgis,et al.  Characterization Scheme for Property Prediction of Fluid Fractions Originating from Biomass , 2015 .

[34]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[35]  Philippe Ungerer,et al.  New optimization method for intermolecular potentials: Optimization of a new anisotropic united atoms potential for olefins: Prediction of equilibrium properties , 2003 .

[36]  P. Ungerer,et al.  Overview of MedeA®-GIBBS capabilities for thermodynamic property calculation and VLE behaviour description of pure compounds and mixtures: application to polar compounds generated from ligno-cellulosic biomass , 2013 .

[37]  V. Lachet,et al.  Monte Carlo simulations of mixtures involving ketones and aldehydes by a direct bubble pressure calculation. , 2010, The journal of physical chemistry. B.

[38]  P. Ungerer,et al.  Prediction of thermodynamic properties of heavy hydrocarbons by Monte Carlo simulation , 2004 .

[39]  Paola Gramatica,et al.  Principles of QSAR models validation: internal and external , 2007 .

[40]  A. D. Mackie,et al.  Transferable force field for equilibrium and transport properties in linear, branched, and bifunctional amines I. Primary amines. , 2011, The journal of physical chemistry. B.

[41]  P. Raybaud,et al.  A QSPR Investigation of Thermal Stability of [Al(CH3)O]n Oligomers in Methylaluminoxane Solution: The Identification of a Geometry-Based Descriptor , 2012 .

[42]  C. Nieto‐Draghi,et al.  Molecular simulation of zeolite flexibility , 2014 .

[43]  N. Pannacci,et al.  Surface photografting of acrylic acid on poly(dimethylsiloxane). Experimental and dissipative particle dynamics studies. , 2015, Langmuir : the ACS journal of surfaces and colloids.

[44]  A. D. Mackie,et al.  Vapour-Liquid Phase Equilibria Predictions of Methane–Alkane Mixtures by Monte Carlo Simulation , 1997 .

[45]  G. Kontogeorgis,et al.  GC-PPC-SAFT Equation of State for VLE and LLE of Hydrocarbons and Oxygenated Compounds. Sensitivity Analysis , 2013 .

[46]  Paola Gramatica,et al.  Real External Predictivity of QSAR Models. Part 2. New Intercomparable Thresholds for Different Validation Criteria and the Need for Scatter Plot Inspection , 2012, J. Chem. Inf. Model..

[47]  P. Rotureau,et al.  A General Guidebook for the Theoretical Prediction of Physicochemical Properties of Chemicals for Regulatory Purposes. , 2015, Chemical reviews.

[48]  A. Ghoufi,et al.  Monte Carlo simulations of the pressure dependence of the water-acid gas interfacial tensions. , 2009, The journal of physical chemistry. B.

[49]  V. Lachet,et al.  Transferable force field for alcohols and polyalcohols. , 2009, The journal of physical chemistry. B.

[50]  Philippe Ungerer,et al.  Anisotropic united atom model including the electrostatic interactions of benzene. , 2007, The journal of physical chemistry. B.

[51]  G. Kontogeorgis,et al.  Improving GC-PPC-SAFT equation of state for LLE of hydrocarbons and oxygenated compounds with water , 2014 .

[52]  Carlos Nieto-Draghi,et al.  A Transferable Force Field for Primary, Secondary, and Tertiary Alkanolamines. , 2013, Journal of chemical theory and computation.

[53]  Philippe Ungerer,et al.  Anisotropic united atom model including the electrostatic interactions of methylbenzenes. II. Transport properties , 2007 .

[54]  R. Lathe Phd by thesis , 1988, Nature.

[55]  B. Creton,et al.  Impact of Associated Gases on Equilibrium and Transport Properties of a $$\mathrm{CO}_{2}$$CO2 Stream: Molecular Simulation and Experimental Studies , 2014 .

[56]  N. Pannacci,et al.  Interfacial tension in oil-water-surfactant systems: on the role of intra-molecular forces on interfacial tension values using DPD simulations. , 2013, The Journal of chemical physics.

[57]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[58]  Suhani J. Patel,et al.  Review of Existing QSAR/QSPR Models Developed for Properties Used in Hazardous Chemicals Classification System , 2012 .

[59]  Michael O'Keeffe,et al.  Synthesis, structure, and carbon dioxide capture properties of zeolitic imidazolate frameworks. , 2010, Accounts of chemical research.

[60]  A Goulon,et al.  Predicting activities without computing descriptors: graph machines for QSAR , 2007, SAR and QSAR in environmental research.

[61]  R. Privat,et al.  Phase equilibrium of CCS mixtures: Equation of state modeling and Monte Carlo simulation , 2017 .

[62]  A. Ghoufi,et al.  Optimisation of the dynamical behaviour of the anisotropic united atom model of branched alkanes: application to the molecular simulation of fuel gasoline , 2008 .

[63]  Ludivine Pidol,et al.  Flash Point and Cetane Number Predictions for Fuel Compounds Using Quantitative Structure Property Relationship (QSPR) Methods , 2011 .

[64]  A. D. Mackie,et al.  Transport coefficients and dynamic properties of hydrogen sulfide from molecular simulation. , 2005, The Journal of chemical physics.

[65]  Philippe Ungerer,et al.  Thermodynamic study of binary systems containing sulphur dioxide: Measurements and molecular modelling , 2011 .

[66]  A. Boutin,et al.  Molecular simulation applied to fluid properties in the oil and gas industry , 2007 .

[67]  Raimondas Galvelis,et al.  Impact of functionalized linkers on the energy landscape of ZIFs , 2013 .

[68]  Bernard Rousseau,et al.  Viscosity of the 1-ethyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide ionic liquid from equilibrium and nonequilibrium molecular dynamics. , 2010, Physical chemistry chemical physics : PCCP.

[69]  Svetoslav H. Slavov,et al.  Quantitative correlation of physical and chemical properties with chemical structure: utility for prediction. , 2010, Chemical reviews.

[70]  Philippe Ungerer,et al.  Optimization of the anisotropic united atoms intermolecular potential for n-alkanes: improvement of transport properties. , 2006, The Journal of chemical physics.

[71]  A. Varnek,et al.  Prediction of Optimal Salinities for Surfactant Formulations Using a Quantitative Structure-Property Relationships Approach , 2015 .

[72]  Morten Hammer,et al.  Equilibrium and transport properties of CO2 + N2O and CO2 + NO mixtures: Molecular simulation and equation of state modelling study , 2012 .

[73]  N. Ferrando,et al.  Prediction of the PC-SAFT associating parameters by molecular simulation. , 2012, The journal of physical chemistry. B.

[74]  Bernard Rousseau,et al.  Influence of the adjustable parameters of the DPD on the global and local dynamics of a polymer melt , 2007 .

[75]  Supratik Kar,et al.  On a simple approach for determining applicability domain of QSAR models , 2015 .