Many Resident Task Computing in Support of Dynamic Ensemble Computations

Modern agent-based models (ABMs) that can simulate large populations are increasingly used to answer important questions in a variety of topic areas. For these models to serve as useful electronic laboratories, they require a battery of experimental analyses, including calibration and validation, sensitivity analysis, optimization, data assimilation, and multimodel integration. These analyses are typically run as workflows that coordinate ensembles of simulations on open science highperformance computing (HPC) systems, enabling large numbers of concurrent simulations. The execution model of these workflows is driven by the algorithmic logic, which can involve arbitrary loops and recursive behavior in pursuit of model convergence. In this paper, we describe a many resident task computing framework that coordinates ABM ensembles on open science HPC systems, driven by a pluggable, stateful optimization engine. While this model challenges conventional notions about workflows consisting of many run-to-completion tasks, we show how it enables rapid prototyping of many parameter search and optimization strategies. Our focus here is on ABMs and optimization, but these techniques are more widely applicable to any black-box scientific code and adaptive parameter space characterization. Ultimately, the goal is to democratize the use of HPC resources by enabling non-expert researchers to take advantage of the extreme scale systems that will be available in the next few years.

[1]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[2]  M. Beaumont Approximate Bayesian Computation in Evolution and Ecology , 2010 .

[3]  Daniel S. Katz,et al.  Toward Interlanguage Parallel Scripting for Distributed-Memory Scientific Computing , 2015, 2015 IEEE International Conference on Cluster Computing.

[4]  G. Evensen Data Assimilation: The Ensemble Kalman Filter , 2006 .

[5]  Daniel S. Katz,et al.  Swift/T: scalable data flow programming for many-task applications , 2013, PPoPP '13.

[6]  Joshua M. Epstein,et al.  Modelling to contain pandemics , 2009, Nature.

[7]  Michael J. North,et al.  Parallel agent-based simulation with Repast for High Performance Computing , 2013, Simul..

[8]  Richard J. Beckman,et al.  A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code , 2000, Technometrics.

[9]  C. Macken,et al.  Mitigation strategies for pandemic influenza in the United States. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[11]  Marc Parizeau,et al.  DEAP: evolutionary algorithms made easy , 2012, J. Mach. Learn. Res..

[12]  Winfried Kurth,et al.  Facilitating Parameter Estimation and Sensitivity Analysis of Agent-Based Models: A Cookbook Using NetLogo and 'R' , 2014, J. Artif. Soc. Soc. Simul..

[13]  Todd E. Combs,et al.  Global critical materials markets: An agent-based modeling approach , 2015 .

[14]  Charles M. Macal,et al.  Managing Business Complexity: Discovering Strategic Solutions with Agent-Based Modeling and Simulation , 2007 .

[15]  Florin Isaila,et al.  Exploiting data locality in Swift/T workflows using Hercules , 2014 .

[16]  Andreas Huth,et al.  Statistical inference for stochastic simulation models--theory and application. , 2011, Ecology letters.

[17]  Sasikiran Kandula,et al.  Inference and Forecast of the Current West African Ebola Outbreak in Guinea, Sierra Leone and Liberia , 2014, PLoS currents.

[18]  Joseph G. Pigeon,et al.  Statistics for Experimenters: Design, Innovation and Discovery , 2006, Technometrics.

[19]  Alicia Karspeck,et al.  Real-Time Influenza Forecasts during the 2012–2013 Season , 2013, Nature Communications.

[20]  I ScottKirkpatrick Optimization by Simulated Annealing: Quantitative Studies , 1984 .

[21]  D. Abramson,et al.  An Automatic Design Optimization Tool and its Application to Computational Fluid Dynamics , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[22]  J. Shaman,et al.  Forecasting seasonal outbreaks of influenza , 2012, Proceedings of the National Academy of Sciences.

[23]  Alicia Karspeck,et al.  Comparison of Filtering Methods for the Modeling and Retrospective Forecasting of Influenza Epidemics , 2014, PLoS Comput. Biol..

[24]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[25]  Max D. Morris,et al.  Factorial sampling plans for preliminary computational experiments , 1991 .

[26]  Nicholson T. Collier,et al.  Modeling the transmission of community-associated methicillin-resistant Staphylococcus aureus: a dynamic agent-based simulation , 2014, Journal of Translational Medicine.

[27]  Jonathan Ozik,et al.  Simulating Water, Individuals, and Management using a coupled and distributed approach , 2014, Proceedings of the Winter Simulation Conference 2014.

[28]  Michael J. North,et al.  Simulating agricultural land rental markets by combining agent-based models with traditional economics concepts: The case of the Argentine Pampas , 2015, Environ. Model. Softw..

[29]  Michael J. North,et al.  Complex adaptive systems modeling with Repast Simphony , 2013, Complex Adapt. Syst. Model..