Machine learning for predicting thermodynamic properties of pure fluids and their mixtures

Abstract Establishing a reliable equation of state for largely non-ideal or multi-component liquid systems is challenging because the complex effects of molecular configurations and/or interactions on the thermodynamic properties must generally be taken into account. In this regard, machine learning holds great potential for directly learning the thermodynamic mappings from existing data, thereby bypassing the use of equations of state. The present study outlines a general machine learning framework based on high-efficiency support vector regression for predicting the thermodynamic properties of pure fluids and their mixtures. The proposed framework is adopted in conjunction with training data obtained from a high-fidelity database to successfully predict the thermodynamic properties of three common pure fluids. The predictions demonstrate extremely low mean square errors. Moreover, little loss in the prediction accuracy is obtained for ternary mixtures of the pure fluids at the cost of a modest increase in the volume of training data provided by state-of-the-art molecular dynamics simulations. Our results demonstrate the promising potential of machine learning for building accurate thermodynamic mappings of pure fluids and their mixtures. The proposed methodology may pave the way in the future for the rapid exploration of novel or complex systems with potentially exceptional thermodynamic properties.

[1]  Liejin Guo,et al.  Hydrogen production from coal gasification in supercritical water with a continuous flowing system , 2010 .

[2]  Masoud Rokni,et al.  Thermodynamic Performance Study of Biomass Gasification, Solid Oxide Fuel Cell and Micro Gas Turbine Hybrid Systems , 2010 .

[3]  S. Lvov,et al.  Phase behavior of the CO2–H2O system at temperatures of 273–623 K and pressures of 0.1–200 MPa using Peng-Robinson-Stryjek-Vera equation of state with a modified Wong-Sandler mixing rule: An extension to the CO2–CH4–H2O system , 2016 .

[4]  Xiao-Dong Wang,et al.  Binary interaction parameter kij for calculating the second cross-virial coefficients of mixtures , 2007 .

[5]  Kristof T. Schütt,et al.  How to represent crystal structures for machine learning: Towards fast prediction of electronic properties , 2013, 1307.1266.

[6]  B. Cao,et al.  A molecular dynamics simulation study of PVT properties for H2O/H2/CO2 mixtures in near-critical and supercritical regions of water , 2018, International Journal of Hydrogen Energy.

[7]  Chongli Zhong,et al.  Molecular simulation of adsorption and diffusion of hydrogen in metal-organic frameworks. , 2005, The journal of physical chemistry. B.

[8]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[9]  W. Cai,et al.  A Generative Model for Inverse Design of Metamaterials , 2018, Nano letters.

[10]  Klaus-Robert Müller,et al.  Machine learning of accurate energy-conserving molecular force fields , 2016, Science Advances.

[11]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[12]  Simón Reif-Acherman,et al.  Peng-Robinson equation of state: 40 years through cubics , 2017 .

[13]  J. Vera,et al.  Understanding cubic equations of state: A search for the hidden clues of their success , 2015 .

[14]  Chiho Kim,et al.  Machine learning in materials informatics: recent applications and prospects , 2017, npj Computational Materials.

[15]  Kim Choon Ng,et al.  Thermodynamic modelling of a solid state thermoelectric cooling device: Temperature–entropy analysis , 2006 .

[16]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[17]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[18]  Barbara Sherwood Lollar,et al.  Solubility trapping in formation water as dominant CO2 sink in natural gas fields , 2009, Nature.

[19]  Xiao Yang,et al.  Energy optimization and analysis modeling based on extreme learning machine integrated index decomposition analysis: Application to complex chemical processes , 2017 .

[20]  Felix A Faber,et al.  Crystal structure representations for machine learning models of formation energies , 2015, 1503.07406.

[21]  Hoover,et al.  Canonical dynamics: Equilibrium phase-space distributions. , 1985, Physical review. A, General physics.

[22]  K. Müller,et al.  Fast and accurate modeling of molecular atomization energies with machine learning. , 2011, Physical review letters.

[23]  Y. Duan,et al.  Gaseous pvTx Properties of Mixtures of Carbon Dioxide and Propane with the Burnett Isochoric Method , 2010 .

[24]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[25]  N. Arndt,et al.  Atmospheric oxygenation caused by a change in volcanic degassing pressure , 2011, Nature.

[26]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[27]  J. Martí,et al.  Density and temperature effects on the orientational and dielectric properties of supercritical water. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Atsuto Seko,et al.  Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids , 2013, 1310.1546.

[29]  Elisabeth Badens,et al.  Supercritical CO2 extraction of neutral lipids from microalgae: Experiments and modelling , 2013 .

[30]  Kwong H. Yung,et al.  Carbon Dioxide's Liquid-Vapor Coexistence Curve And Critical Properties as Predicted by a Simple Molecular Model , 1995 .

[31]  Klaus-Robert Müller,et al.  Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies. , 2013, Journal of chemical theory and computation.

[32]  Zhenhao Duan,et al.  An optimized molecular potential for carbon dioxide. , 2005, The Journal of chemical physics.

[33]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[34]  Zhenhao Duan,et al.  Equation of state of the H2O, CO2, and H2O-CO2 systems up to 10 GPa and 2573.15 K: Molecular dynamics simulations with ab initio potential surface , 2006 .

[35]  John H. Weare,et al.  A general equation of state for supercritical fluid mixtures and molecular dynamics simulation of mixture PVTX properties , 1996 .

[36]  Yanbin Yuan,et al.  Wind power prediction using hybrid autoregressive fractionally integrated moving average and least square support vector machine , 2017 .

[37]  Chih-Jen Lin,et al.  Training v-Support Vector Regression: Theory and Algorithms , 2002, Neural Computation.

[38]  Hiroshi Yamaguchi,et al.  Theoretical analysis of a thermodynamic cycle for power and heat production using supercritical carbon dioxide , 2007 .

[39]  Honglai Liu,et al.  Thermodynamic properties and vapor–liquid equilibria of associating fluids, Peng–Robinson equation of state coupled with shield-sticky model , 2012 .

[40]  Wei D. Lu,et al.  Experimental Demonstration of Feature Extraction and Dimensionality Reduction Using Memristor Networks. , 2017, Nano letters.

[41]  Paul Raccuglia,et al.  Machine-learning-assisted materials discovery using failed experiments , 2016, Nature.

[42]  Li Li,et al.  Bypassing the Kohn-Sham equations with machine learning , 2016, Nature Communications.

[43]  Rampi Ramprasad,et al.  A universal strategy for the creation of machine learning-based atomistic force fields , 2017, npj Computational Materials.

[44]  K. Müller,et al.  Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space , 2015, The journal of physical chemistry letters.

[45]  D. Peng,et al.  A New Two-Constant Equation of State , 1976 .

[46]  Volker L. Deringer,et al.  Machine learning based interatomic potential for amorphous carbon , 2016, 1611.03277.

[47]  C. E. Hori,et al.  Evaluation of different contribution methods over the performance of Peng–Robinson and CPA equation of state in the correlation of VLE of triglycerides, fatty esters and glycerol + CO2 and alcohol , 2014 .

[48]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[49]  Paul A. Midgley,et al.  Multicomponent Signal Unmixing from Nanoheterostructures: Overcoming the Traditional Challenges of Nanoscale X-ray Analysis via Machine Learning , 2015, Nano letters.

[50]  H. Nishiumi,et al.  Generalization of binary interaction parameters of Peng-Robinson equation of state for systems containing hydrogen , 1990 .

[51]  G. Soave Equilibrium constants from a modified Redlich-Kwong equation of state , 1972 .

[52]  T. Buonassisi,et al.  Machine Learning for Novel Thermal-Materials Discovery: Early Successes, Opportunities, and Challenges. , 2019, 1901.05801.

[53]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.