Compressive sensing as a paradigm for building physics models

The widely accepted intuition that the important properties of solids are determined by a few key variables underpins many methods in physics. Though this reductionist paradigm is applicable in many physical problems, its utility can be limited because the intuition for identifying the key variables often does not exist or is difficult to develop. Machine learning algorithms (genetic programming, neural networks, Bayesian methods, etc.) attempt to eliminate the ap riorineed for such intuition but often do so with increased computational burden and human time. A recently developed technique in the field of signal processing, compressive sensing (CS), provides a simple, general, and efficient way of finding the key descriptive variables. CS is a powerful paradigm for model building; we show that its models are more physical and predict more accurately than current state-of-the-art approaches and can be constructed at a fraction of the computational cost and user effort.

[1]  A. Miedema,et al.  On the heat of formation of solid alloys , 1975 .

[2]  Alex Zunger,et al.  Systematization of the stable crystal structure of all AB-type binary compounds: A pseudopotential orbital-radii approach , 1980 .

[3]  P. Villars,et al.  A three-dimensional structural stability diagram for 1011 binary AB2 intermetallic compounds: II , 1983 .

[4]  J. Connolly,et al.  Density-functional theory applied to phase transformations in transition-metal alloys , 1983 .

[5]  D. G. Pettifor,et al.  A chemical scale for crystal-structure maps , 1984 .

[6]  F. Ducastelle,et al.  Generalized cluster description of multicomponent systems , 1984 .

[7]  D. Pettifor,et al.  The structures of binary compounds. I. Phenomenological structure maps , 1986 .

[8]  S. Froyen,et al.  Brillouin-zone integration by Fourier quadrature: Special points for superlattice and supercell calculations. , 1989, Physical review. B, Condensed matter.

[9]  Ferreira,et al.  Efficient cluster expansion for substitutional systems. , 1992, Physical review. B, Condensed matter.

[10]  D. Fontaine Cluster Approach to Order-Disorder Transformations in Alloys , 1994 .

[11]  Alex Zunger,et al.  First-Principles Statistical Mechanics of Semiconductor Alloys and Intermetallic Compounds , 1994 .

[12]  Blöchl,et al.  Projector augmented-wave method. , 1994, Physical review. B, Condensed matter.

[13]  P. Feschotte,et al.  A revision of the binary system AgPt , 1996 .

[14]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.

[15]  G. Kresse,et al.  Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set , 1996 .

[16]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[17]  G. Kresse,et al.  From ultrasoft pseudopotentials to the projector augmented-wave method , 1999 .

[18]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .

[19]  Xiaoming Huo,et al.  Uncertainty principles and ideal atomic decomposition , 2001, IEEE Trans. Inf. Theory.

[20]  A. van de Walle,et al.  The Alloy Theoretic Automated Toolkit: A User Guide , 2002 .

[21]  Nikolai A Zarkevich,et al.  Reliable first-principles alloy thermodynamics via truncated cluster expansions. , 2004, Physical review letters.

[22]  Gus L. W. Hart,et al.  Using genetic algorithms to map first-principles results to model Hamiltonians: Application to the generalized Ising model for alloys , 2005 .

[23]  Gevorg Grigoryan,et al.  Coarse-graining protein energetics in sequence variables. , 2005, Physical review letters.

[24]  E. Candès,et al.  Stable signal recovery from incomplete and inaccurate measurements , 2005, math/0503066.

[25]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[26]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[27]  Ralf Drautz,et al.  Obtaining cluster expansion coefficients in ab initio thermodynamics of multicomponent lattice-gas systems , 2006 .

[28]  Gerbrand Ceder,et al.  Predicting crystal structure by merging data mining with quantum mechanics , 2006, Nature materials.

[29]  E. Candès,et al.  Sparsity and incoherence in compressive sampling , 2006, math/0611957.

[30]  Ralf Drautz,et al.  Cluster expansions in multicomponent systems: precise expansions from noisy databases , 2007, Journal of physics. Condensed matter : an Institute of Physics journal.

[31]  A. Dick,et al.  Free energy of bcc iron: Integrated ab initio derivation of vibrational, electronic, and magnetic contributions , 2008 .

[32]  Wotao Yin,et al.  Bregman Iterative Algorithms for (cid:2) 1 -Minimization with Applications to Compressed Sensing ∗ , 2008 .

[33]  Gus L. W. Hart,et al.  Generating derivative structures: Algorithm and applications , 2008, 0804.3544.

[34]  A. Jansen,et al.  Bayesian approach to the calculation of lateral interactions: NO/Rh(111) , 2008 .

[35]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[36]  S. Woodley,et al.  Crystal structure prediction from first principles. , 2008, Nature materials.

[37]  R. Drautz,et al.  Theory of size mismatched alloy systems: many-body Kanzaki forces , 2008 .

[38]  R. Forcade,et al.  Generating derivative structures from multilattices: Algorithm and application to hcp alloys , 2009 .

[39]  Tom Goldstein,et al.  The Split Bregman Method for L1-Regularized Problems , 2009, SIAM J. Imaging Sci..

[40]  Gevorg Grigoryan,et al.  Design of protein-interaction specificity affords selective bZIP-binding peptides , 2009, Nature.

[41]  Damien Stehlé,et al.  Low-dimensional lattice basis reduction revisited , 2004, TALG.

[42]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[43]  Thomas Bligaard,et al.  Virtual materials design using databases of calculated materials properties , 2009 .

[44]  Atsuto Seko,et al.  Cluster expansion method for multicomponent systems based on optimal selection of structures for density-functional theory calculations , 2009 .

[45]  Tim Mueller,et al.  Bayesian approach to cluster expansions , 2009 .

[46]  R. Forcade,et al.  UNCLE: a code for constructing cluster expansions for arbitrary lattices with minimal user-input , 2009 .

[47]  Axel van de Walle,et al.  Multicomponent multisublattice alloys, nonconfigurational entropy and other additions to the Alloy Theoretic Automated Toolkit , 2009, 0906.1608.

[48]  Long-wavelength elastic interactions in complex crystals. , 2010, Physical review letters.

[49]  Axel van de Walle,et al.  Building effective models from sparse but precise data: Application to an alloy cluster expansion model , 2009, 0908.0659.

[50]  Tim Mueller,et al.  Exact expressions for structure selection in cluster expansions , 2010 .

[51]  J. C. Schön,et al.  Predicting solid compounds via global exploration of the energy landscape of solids on the ab initio level without recourse to experimental information , 2010 .

[52]  Reciprocal-space cluster expansions for complex alloys with long-range interactions , 2011 .

[53]  Stefano Curtarolo,et al.  High-throughput combinatorial database of electronic band structures for inorganic scintillator materials. , 2011, ACS combinatorial science.

[54]  Mohammed AlQuraishi,et al.  Direct inference of protein–DNA interactions using compressed sensing methods , 2011, Proceedings of the National Academy of Sciences.

[55]  Lance J. Nelson,et al.  Ground-state characterizations of systems predicted to exhibit L11 or L13 crystal structures , 2012 .

[56]  New insights on strain energies in hexagonal systems , 2012 .