Machine learning strategies for systems with invariance properties

In many scientific fields, empirical models are employed to facilitate computational simulations of engineering systems. For example, in fluid mechanics, empirical Reynolds stress closures enable computationally-efficient Reynolds Averaged Navier Stokes simulations. Likewise, in solid mechanics, constitutive relations between the stress and strain in a material are required in deformation analysis. Traditional methods for developing and tuning empirical models usually combine physical intuition with simple regression techniques on limited data sets. The rise of high performance computing has led to a growing availability of high fidelity simulation data. These data open up the possibility of using machine learning algorithms, such as random forests or neural networks, to develop more accurate and general empirical models. A key question when using data-driven algorithms to develop these empirical models is how domain knowledge should be incorporated into the machine learning process. This paper will specifically address physical systems that possess symmetry or invariance properties. Two different methods for teaching a machine learning model an invariance property are compared. In the first method, a basis of invariant inputs is constructed, and the machine learning model is trained upon this basis, thereby embedding the invariance into the model. In the second method, the algorithm is trained on multiple transformations of the raw input data until the model learns invariance to that transformation. Results are discussed for two case studies: one in turbulence modeling and one in crystal elasticity. It is shown that in both cases embedding the invariance property into the input features yields higher performance at significantly reduced computational training costs.

[1]  M. Lefik,et al.  Artificial neural network as an incremental non-linear constitutive model for a finite element code , 2003 .

[2]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[3]  Julia Kastner,et al.  Handbook Of Fluid Dynamics , 2016 .

[4]  B. Launder,et al.  Development and application of a cubic eddy-viscosity model of turbulence , 1996 .

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Stephen I. Gallant,et al.  Neural network learning and expert systems , 1993 .

[7]  A.J.M. Spencer,et al.  Isotropic Polynomial Invariants and Tensor Functions , 1987 .

[8]  M. Hammermesh,et al.  Group theory and its applications to physical problems , 1989 .

[9]  B. Bay,et al.  Digital volume correlation: Three-dimensional strain mapping using X-ray tomography , 1999 .

[10]  Jamshid Ghaboussi,et al.  Neural network constitutive model for rate-dependent materials , 2006 .

[11]  Karthikeyan Duraisamy,et al.  Machine Learning Methods for Data-Driven Turbulence Modeling , 2015 .

[12]  Foster J. Provost,et al.  A Survey of Methods for Scaling Up Inductive Algorithms , 1999, Data Mining and Knowledge Discovery.

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  Genki Yagawa,et al.  Implicit constitutive modelling for viscoplasticity using neural networks , 1998 .

[15]  JEFFREY WOOD,et al.  Invariant pattern recognition: A review , 1996, Pattern Recognit..

[16]  J. Templeton Evaluation of machine learning algorithms for prediction of regions of high Reynolds averaged Navier Stokes uncertainty , 2015 .

[17]  Robert N. Cahn,et al.  Semi-Simple Lie Algebras and Their Representations , 1984 .

[18]  Yann LeCun,et al.  Tangent Prop - A Formalism for Specifying Selected Invariances in an Adaptive Network , 1991, NIPS.

[19]  Christopher J. Elkins,et al.  Magnetic resonance velocimetry: applications of magnetic resonance imaging in the measurement of fluid motion , 2007 .

[20]  Fulvio Scarano,et al.  Tomographic PIV: principles and practice , 2012 .

[21]  A. Spencer Continuum Mechanics , 1967, Nature.

[22]  J. Boehler,et al.  Applications of Tensor Functions in Solid Mechanics , 1987 .

[23]  G. F. Smith On isotropic integrity bases , 1965 .

[24]  Patrick van der Smagt,et al.  Introduction to neural networks , 1995, The Lancet.

[25]  Brendan D. Tracey,et al.  A Machine Learning Strategy to Assist Turbulence Model Development , 2015 .

[26]  J. Ludden,et al.  Principles and Practice , 1998, Community-based Learning and Social Movements.

[27]  Jamshid Ghaboussi,et al.  New nested adaptive neural networks (NANN) for constitutive modeling , 1998 .

[28]  Guanghui Liang,et al.  Neural network based constitutive model for elastomeric foams , 2008 .

[29]  James H. Garrett,et al.  Knowledge-Based Modeling of Material Behavior with Neural Networks , 1992 .

[30]  Li-Jia Li,et al.  Multi-view Face Detection Using Deep Convolutional Neural Networks , 2015, ICMR.

[31]  Larry R. Oliver,et al.  Finite element analysis of V-ribbed belts using neural network based hyperelastic material model , 2005 .

[32]  Rui Zhao,et al.  Stress-Strain Modeling of Sands Using Artificial Neural Networks , 1995 .

[33]  Michael A. Leschziner,et al.  An investigation of wall-anisotropy expressions and length-scale equations for non-linear eddy-viscosity models , 2003 .

[34]  Henry S. Baird,et al.  Document image defect models , 1995 .

[35]  Gianluca Iaccarino,et al.  A numerical study of scalar dispersion downstream of a wall-mounted cube using direct simulations and algebraic flux models , 2010 .

[36]  D. Hilbert,et al.  Theory of algebraic invariants , 1993 .

[37]  Lutz Prechelt,et al.  Automatic early stopping using cross validation: quantifying the criteria , 1998, Neural Networks.

[38]  Gianluca Iaccarino,et al.  Numerical analysis and modeling of plume meandering in passive scalar dispersion downstream of a wall-mounted cube , 2013 .

[39]  Bernhard Schölkopf,et al.  Training Invariant Support Vector Machines , 2002, Machine Learning.

[40]  Yunlian Qi,et al.  Development of constitutive relationship model of Ti600 alloy using artificial neural network , 2010 .

[41]  Ronald S. Rivlin,et al.  Non linear continuum theories in mechanics and physics and their applications : II ciclo. Bressanone, 3-11 settembre 1969. Coordinatore: Prof. R. S. Rivlin , 1970 .

[42]  Q. Zheng Theory of Representations for Tensor Functions—A Unified Invariant Approach to Constitutive Equations , 1994 .

[43]  R. S. Rivlin,et al.  Isotropic integrity bases for vectors and second-order tensors , 1962 .

[44]  M. Grzes,et al.  Plan-based reward shaping for reinforcement learning , 2008, 2008 4th International IEEE Conference Intelligent Systems.

[45]  Richard A. Olshen,et al.  CART: Classification and Regression Trees , 1984 .

[46]  Foiles,et al.  Embedded-atom-method functions for the fcc metals Cu, Ag, Au, Ni, Pd, Pt, and their alloys. , 1986, Physical review. B, Condensed matter.

[47]  Michele Milano,et al.  Neural network modeling for near wall turbulent flow , 2002 .

[48]  S. Pope A more general effective-viscosity hypothesis , 1975, Journal of Fluid Mechanics.

[49]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Real Databases: A Report on the IJCAI-89 Workshop , 1991, AI Mag..

[50]  Xia-Ting Feng,et al.  Genetic evolution of nonlinear material constitutive models , 2001 .

[51]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[52]  R. Michalski Understanding the Nature of Learning: Issues and Research Directions , 1985 .

[53]  Raul Radovitzky,et al.  A polyconvex model for materials with cubic symmetry , 2007 .

[54]  Charles G. Speziale,et al.  A consistency condition for non-linear algebraic Reynolds stress models in turbulence , 1998 .

[55]  Brendan D. Tracey,et al.  Application of supervised learning to quantify uncertainties in turbulence and combustion modeling , 2013 .

[56]  Julia Ling,et al.  Machine Learning Models for Detection of Regions of High Model Form Uncertainty in RANS. , 2015 .

[57]  Mary F. Wheeler,et al.  Boosting iterative stochastic ensemble method for nonlinear calibration of subsurface flow models , 2013 .

[58]  R. Ogden Large deformation isotropic elasticity – on the correlation of theory and experiment for incompressible rubberlike solids , 1972, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[59]  M. Hamermesh Group theory and its application to physical problems , 1962 .

[60]  R. Landel,et al.  The Strain‐Energy Function of a Hyperelastic Material in Terms of the Extension Ratios , 1967 .

[61]  P. Spalart A One-Equation Turbulence Model for Aerodynamic Flows , 1992 .

[62]  Lawrence O. Hall,et al.  A Comparison of Ensemble Creation Techniques , 2004, Multiple Classifier Systems.

[63]  Stéphane Roux,et al.  Voxel-Scale Digital Volume Correlation , 2011 .

[64]  Anand Pratap Singh,et al.  New Approaches in Turbulence and Transition Modeling Using Data-driven Techniques , 2015 .

[65]  Gianluca Iaccarino,et al.  The deviation from parallel shear flow as an indicator of linear eddy-viscosity model inaccuracy , 2014 .