Active Learning for Input Space Exploration in Traffic Simulators

Urban environments are systems of overwhelming complexity and dynamism, involving numerous variables and idiosyncrasies which not usually easy to model from a functional perspective. Simulation modeling is a common and well-accepted approach to study such systems, specially those that prove to be too complex to be analyzed by standard analytic methods. However, such urban simulation models can become computationally very expensive to run. To address this drawback, simulation metamodels can be employed to approximate the underlying simulation function. In this paper, we propose a batch-mode active learning strategy based on Gaussian Processes metamodeling that searches for the most informative data points in batches with respect to their corresponding predictive variances. These points are selected in such a way that they originate from different high variance neighborhoods. Eventually, this allows us to analyze the simulation output behavior with fewer simulation requests. Using an illustrative traffic simulation example, the results show that the proposed restricted batch-mode strategy is able to increase the simulation input space exploration efficiency in comparison with standard batch-mode strategies.

[1]  Linda Weiser Friedman,et al.  The Simulation Metamodel , 1995 .

[2]  Wenbin Cai,et al.  Stability-Based Stopping Criterion for Active Learning , 2014, 2014 IEEE International Conference on Data Mining.

[3]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[4]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[5]  Alexis Boukouvalas,et al.  Emulation of random output simulators , 2010 .

[6]  Tao Chen,et al.  Efficient meta-modelling of complex process simulations with time-space-dependent outputs , 2011, Comput. Chem. Eng..

[7]  Andreas Vlachos,et al.  A stopping criterion for active learning , 2008, Computer Speech and Language.

[8]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[9]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[10]  Jack P. C. Kleijnen,et al.  A methodology for fitting and validating metamodels in simulation , 2000, Eur. J. Oper. Res..

[11]  Michal Wozniak,et al.  Active learning approach to concept drift problem , 2012, Log. J. IGPL.

[12]  Yi Zhang,et al.  Incorporating Diversity and Density in Active Learning for Relevance Feedback , 2007, ECIR.

[13]  Masood Ghayoomi,et al.  Using Variance as a Stopping Criterion for Active Learning of Frame Assignment , 2010, HLT-NAACL 2010.

[14]  Rachel T. Johnson,et al.  Design and analysis for the Gaussian process model , 2009, Qual. Reliab. Eng. Int..

[15]  B. Sansó,et al.  Advances in the Sequential Design of Computer Experiments Based on Active Learning , 2011 .

[16]  Jack P. C. Kleijnen,et al.  Kriging Metamodeling in Simulation: A Review , 2007, Eur. J. Oper. Res..

[17]  Rong Jin,et al.  Large-scale text categorization by batch mode active learning , 2006, WWW '06.

[18]  Jack P. C. Kleijnen,et al.  Regression metamodels for generalizing simulation results. , 1979 .

[19]  Geoff Holmes,et al.  Active Learning With Drifting Streaming Data , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Jack P. C. Kleijnen,et al.  Kriging interpolation in simulation: a survey , 2004, Proceedings of the 2004 Winter Simulation Conference, 2004..

[21]  J Bligh Learning from uncertainty: a change of culture , 2001, Medical education.

[22]  Zheng Zhu,et al.  Surrogate-Based Optimization for Solving a Mixed Integer Network Design Problem , 2015 .

[23]  A. O'Hagan,et al.  Bayesian emulation of complex multi-output and dynamic computer models , 2010 .

[24]  Cheong Hee Park,et al.  An active learning method for data streams with concept drift , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[25]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[26]  Daniel Krajzewicz,et al.  Recent Development and Applications of SUMO - Simulation of Urban MObility , 2012 .

[27]  Russell R. Barton,et al.  Simulation metamodels , 1998, 1998 Winter Simulation Conference. Proceedings (Cat. No.98CH36274).

[28]  Klaus Brinker,et al.  Incorporating Diversity in Active Learning with Support Vector Machines , 2003, ICML.

[29]  Hinrich Schütze,et al.  Stopping Criteria for Active Learning of Named Entity Recognition , 2008, COLING.

[30]  Jingbo Zhu,et al.  Confidence-based stopping criteria for active learning for data annotation , 2010, TSLP.

[31]  Chenfeng Xiong,et al.  Bayesian Stochastic Kriging Metamodel for Active Traffic Management of Corridors , 2014 .

[32]  Jack P. C. Kleijnen,et al.  Application-driven sequential designs for simulation experiments: Kriging metamodelling , 2004, J. Oper. Res. Soc..

[33]  Lei Zhang,et al.  Surrogate‐Based Optimization of Expensive‐to‐Evaluate Objective for Optimal Highway Toll Charges in Transportation Network , 2014, Comput. Aided Civ. Infrastructure Eng..

[34]  Rong Jin,et al.  Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Steve Bankes,et al.  Exploratory Modeling for Policy Analysis , 1993, Oper. Res..

[36]  Lihong Li,et al.  Unbiased online active learning in data streams , 2011, KDD.

[37]  Geoff Holmes,et al.  Active Learning with Evolving Streaming Data , 2011, ECML/PKDD.

[38]  Terry L. Friesz,et al.  Statistical metamodeling of dynamic network loading , 2018, Transportation Research Part B: Methodological.

[39]  Tobias Pfingsten,et al.  Bayesian Active Learning for Sensitivity Analysis , 2006, ECML.

[40]  Marcello Montanino,et al.  Gaussian Process Metamodels for Sensitivity Analysis of Traffic Simulation Models , 2013 .