Analyzing Particle Systems for Machine Learning and Data Visualization with freud

The freud Python library analyzes particle data output from molecular dynamics simulations. The library’s design and its variety of highperformance methods make it a powerful tool for many modern applications. In particular, freud can be used as part of the data generation pipeline for machine learning (ML) algorithms for analyzing particle simulations, and it can be easily integrated with various simulation visualization tools for simultaneous visualization and real-time analysis. Here, we present numerous examples both of using freud to analyze nano-scale particle systems by coupling traditional simulational analyses to machine learning libraries and of visualizing per-particle quantities calculated by freud analysis methods. We include code and examples of this visualization, showing that in general the introduction of freud into existing ML and visualization workflows is smooth and unintrusive. We demonstrate that among Python packages used in the computational molecular sciences, freud offers a unique set of analysis methods with efficient computations and seamless coupling into powerful data analysis pipelines.

[1]  Greg van Anders,et al.  Pressure-tunable photonic band gaps in an entropic colloidal crystal , 2018, Physical Review Materials.

[2]  Sharon C. Glotzer,et al.  freud: A Software Suite for High Throughput Analysis of Particle Simulation Data , 2019, Comput. Phys. Commun..

[3]  Sharon C. Glotzer,et al.  Shape and symmetry determine two-dimensional melting transitions of hard regular polygons , 2016, 1606.00687.

[4]  S. Glotzer,et al.  Anisotropy of building blocks and their assembly into complex structures. , 2007, Nature materials.

[5]  Sharon C Glotzer,et al.  Understanding shape entropy through local dense packing , 2013, Proceedings of the National Academy of Sciences.

[6]  Sharon C Glotzer,et al.  Entropically patchy particles: engineering valence through shape entropy. , 2013, ACS nano.

[7]  Joshua A. Anderson,et al.  General purpose molecular dynamics simulations fully implemented on graphics processing units , 2008, J. Comput. Phys..

[8]  Michael J. Campolongo,et al.  Building plasmonic nanostructures with DNA. , 2011, Nature nanotechnology.

[9]  Stefan Behnel,et al.  Cython: The Best of Both Worlds , 2011, Computing in Science & Engineering.

[10]  Matteo Salvalaglio,et al.  DeepIce: A Deep Neural Network Approach To Identify Ice and Water Molecules , 2019, J. Chem. Inf. Model..

[11]  S. Glotzer,et al.  Using depletion to control colloidal crystal assemblies of hard cuboctahedra. , 2015, Soft matter.

[12]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[13]  A. Stukowski Visualization and analysis of atomistic simulation data with OVITO–the Open Visualization Tool , 2009 .

[14]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[15]  Carl S Adorf,et al.  Inverse design of simple pair potentials for the self-assembly of complex structures. , 2017, The Journal of chemical physics.

[16]  S. Glotzer,et al.  Hierarchical self-assembly of hard cube derivatives. , 2019, Soft matter.

[17]  Sharon C Glotzer,et al.  Machine learning for crystal identification and discovery , 2017, 1710.09861.

[18]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[19]  Oliver Beckstein,et al.  MDAnalysis: A toolkit for the analysis of molecular dynamics simulations , 2011, J. Comput. Chem..

[20]  Eric S. Harper,et al.  Shape allophiles improve entropic assembly. , 2015, Soft matter.

[21]  P. Steinhardt,et al.  Bond-orientational order in liquids and glasses , 1983 .

[22]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[23]  Thomas J Lane,et al.  MDTraj: a modern, open library for the analysis of molecular dynamics trajectories , 2014, bioRxiv.

[24]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[25]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[26]  Christoph Dellago,et al.  Accurate determination of crystal structures based on averaged local bond order parameters. , 2008, The Journal of chemical physics.

[27]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[28]  P. Damasceno,et al.  Predictive Self-Assembly of Polyhedra into Complex Structures , 2012, Science.

[29]  S. Glotzer,et al.  Identity crisis in alchemical space drives the entropic colloidal glass transition , 2019, Nature Communications.

[30]  Pak Lui,et al.  Strong scaling of general-purpose molecular dynamics simulations on GPUs , 2014, Comput. Phys. Commun..

[31]  Barrett R. Morrow,et al.  Supercharging enables organized assembly of synthetic biomolecules , 2019, Nature Chemistry.

[32]  Andrea J. Liu,et al.  A structural approach to relaxation in glassy liquids , 2015, Nature Physics.