Behavioral analytics for myopic agents

Many multi-agent systems have the structure of a single coordinator providing behavioral or financial incentives to a large number of agents. Two challenges faced by the coordinator are a finite budget from which to allocate incentives, and an initial lack of knowledge about the utility function of the agents. Here, we present a behavioral analytics approach to solve the coordinator's problem when agents make decisions by maximizing utility functions that depend on prior system states, inputs, and other parameters that are initially unknown and subject to temporal dynamics. Our behavioral analytics framework involves three steps: first, we develop a behavioral model that describes the decision-making process of an agent; second, we use data to estimate behavioral model parameters for each agent and then use these estimates to predict future decisions of each agent; and third, we use the estimated behavioral model parameters to optimize a set of costly incentives to provide to each agent. In this paper, we describe a specific set of tools, models, and approaches that fit into this framework, and that adapt models and incentives as new information is collected by repeating the second and third steps of this framework. Furthermore, we prove that incentives computed by this adaptive approach are asymptotically optimal with respect to a given loss function that describes the coordinator's objective. We optimize incentives using a decomposition scheme, where each sub-problem solves the coordinator's problem for a single agent, and the master problem is a pure integer program. We conclude with a simulation study to evaluate the effectiveness of our behavioral analytics approach in designing personalized treatment plans for a weight loss program. The results show our approach maintains efficacy of the program while reducing costs by up to 60%, while adaptive heuristics provide less savings.

[1]  Turgay Ayer,et al.  Heterogeneity in Women's Adherence and Its Role in Optimal Breast Cancer Screening Policies , 2016, Manag. Sci..

[2]  Donna Spruijt-Metz,et al.  Current mHealth technologies for physical activity assessment and promotion. , 2013, American journal of preventive medicine.

[3]  Henrik Ohlsson,et al.  Incentive Design and Utility Learning via Energy Disaggregation , 2013, 1312.1394.

[4]  Paul H. Rubin The Theory of the Firm and the Structure of the Franchise Contract , 1978, Journal law and economy.

[5]  Jiming Chen,et al.  A Survey on Demand Response in Smart Grids: Mathematical Models and Approaches , 2015, IEEE Transactions on Industrial Informatics.

[6]  Peter I. Frazier,et al.  Bayesian optimization for materials design , 2015, 1506.01349.

[7]  Vishal Gupta,et al.  Data-driven estimation in equilibrium using inverse optimization , 2013, Mathematical Programming.

[8]  Xu Andy Sun,et al.  Adaptive Robust Optimization With Dynamic Uncertainty Sets for Multi-Period Economic Dispatch Under Significant Wind , 2015 .

[9]  H. Vincent Poor,et al.  Scheduling Power Consumption With Price Uncertainty , 2011, IEEE Transactions on Smart Grid.

[10]  Sarah C. Darby,et al.  Smart metering: what potential for householder engagement? , 2010 .

[11]  Vincent W. S. Wong,et al.  Advanced Demand Side Management for the Future Smart Grid Using Mechanism Design , 2012, IEEE Transactions on Smart Grid.

[12]  Martin W. P. Savelsbergh,et al.  Stratified patient appointment scheduling for mobile community-based chronic disease management programs , 2016 .

[13]  Luciano Messori The Theory of Incentives I: The Principal-Agent Model , 2013 .

[14]  Anil Aswani,et al.  Behavioral modeling in weight loss interventions , 2019, Eur. J. Oper. Res..

[15]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[16]  E. Gal‐Or Maintaining Quality Standards in Franchise Chains , 1995 .

[17]  Anil Aswani,et al.  Duality Approach to Bilevel Programs with a Convex Lower Level , 2016, 2018 Annual American Control Conference (ACC).

[18]  S. Shankar Sastry,et al.  Provably safe and robust learning-based model predictive control , 2011, Autom..

[19]  Vincent W. S. Wong,et al.  Optimal Real-Time Pricing Algorithm Based on Utility Maximization for Smart Grid , 2010, 2010 First IEEE International Conference on Smart Grid Communications.

[20]  J. Blanchet,et al.  A markov chain approximation to choice modeling , 2013, EC '13.

[21]  Zuo-Jun Max Shen,et al.  Inverse Optimization with Noisy Data , 2015, Oper. Res..

[22]  Michal Kaut,et al.  Evaluation of scenario-generation methods for stochastic programming , 2007 .

[23]  Devin Mann,et al.  Evidence-based strategies in weight-loss mobile apps. , 2013, American journal of preventive medicine.

[24]  B. Pasik-Duncan,et al.  Adaptive Control , 1996, IEEE Control Systems.

[25]  G. Kariniotakis,et al.  A Stochastic Dynamic Programming Model for Optimal Use of Local Energy Resources in a Market Environment , 2007, 2007 IEEE Lausanne Power Tech.

[26]  Peter W. Glynn,et al.  Dynamics on Linear Influence Network Games Under Stochastic Environments , 2016, GameSec.

[27]  Warren B. Powell,et al.  An adaptive-learning framework for semi-cooperative multi-agent coordination , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[28]  Karl Aberer,et al.  Effective consumption scheduling for demand-side management in the smart grid using non-uniform participation rate , 2013, 2013 Sustainable Internet and ICT for Sustainability (SustainIT).

[29]  Mukund Sundararajan,et al.  Mean Field Equilibria of Dynamic Auctions with Learning , 2014, Manag. Sci..

[30]  M. Mifflin,et al.  A new predictive equation for resting energy expenditure in healthy individuals. , 1990, The American journal of clinical nutrition.

[31]  D. Fudenberg,et al.  Digitized by the Internet Archive in 2011 with Funding from Working Paper Department of Economics the Folk Theorem with Imperfect Public Information , 2022 .

[32]  Petr Lachout,et al.  On continuous convergence and epi-convergence of random functions. Part II: Sufficient conditions and applications , 2003, Kybernetika.

[33]  R. Lal Improving Channel Coordination Through Franchising , 1990 .

[34]  C. Fox,et al.  Breast-cancer screening. , 1979, Lancet.

[35]  Claire J. Tomlin,et al.  Monotone Piecewise Affine Systems , 2009, IEEE Transactions on Automatic Control.

[36]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[37]  Mariel S. Lavieri,et al.  Dynamic Forecasting and Control Algorithms of Glaucoma Progression for Clinician Decision Support , 2015, Oper. Res..

[38]  Mohsen Bayati,et al.  Online Decision-Making with High-Dimensional Covariates , 2015 .

[39]  Jonathan F. Bard,et al.  The Mixed Integer Linear Bilevel Programming Problem , 1990, Oper. Res..

[40]  Cynthia Rudin,et al.  The Big Data Newsvendor: Practical Insights from Machine Learning , 2013, Oper. Res..

[41]  Stephen P. Boyd,et al.  Imputing a convex objective function , 2011, 2011 IEEE International Symposium on Intelligent Control.

[42]  Kevin L. Joiner,et al.  A Novel Diabetes Prevention Intervention Using a Mobile App: A Randomized Controlled Trial With Overweight Adults at Risk. , 2015, American journal of preventive medicine.

[43]  Firooz B. Saghezchi,et al.  Game-theoretic based scheduling for demand-side management in 5G smart grids , 2015, 2015 IEEE Symposium on Computers and Communication (ISCC).

[44]  Benjamin Van Roy,et al.  Deep Exploration via Bootstrapped DQN , 2016, NIPS.

[45]  Gerard J. M. Smit,et al.  Management and Control of Domestic Smart Grid Technology , 2010, IEEE Transactions on Smart Grid.

[46]  Pinar Keskinocak,et al.  OR Practice - Catch-Up Scheduling for Childhood Vaccination , 2009, Oper. Res..

[47]  Brian T. Denton,et al.  Using Electronic Health Records to Monitor and Improve Adherence to Medication , 2013 .

[48]  Susan X. Li,et al.  Transaction-efficiency analysis of franchising arrangements through chance cross-constrained game theory , 1997, Comput. Oper. Res..

[49]  Diwakar Gupta,et al.  Adaptive Appointment Systems with Patient Preferences , 2011, Manuf. Serv. Oper. Manag..

[50]  Benjamin Van Roy,et al.  Bootstrapped Thompson Sampling and Deep Exploration , 2015, ArXiv.

[51]  S. Shankar Sastry,et al.  Adaptive Control of Mechanical Manipulators , 1987 .

[52]  Chih-Ping Chou,et al.  Technology-facilitated depression care management among predominantly Latino diabetes patients within a public safety net care system: comparative effectiveness trial design. , 2014, Contemporary clinical trials.

[53]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[54]  M. Roizen,et al.  10-year follow-up of diabetes incidence and weight loss in the Diabetes Prevention Program Outcomes Study , 2010 .

[55]  Petr Lachout,et al.  On continuous convergence and epi-convergence of random functions. Part I: Theory and relations , 2003, Kybernetika.

[56]  Ramesh Johari,et al.  Mean Field Equilibrium in Dynamic Games with Strategic Complementarities , 2013, Oper. Res..

[57]  Turgay Ayer,et al.  OR Forum - A POMDP Approach to Personalize Mammography Screening Decisions , 2012, Oper. Res..

[58]  C. Desoer,et al.  Linear System Theory , 1963 .

[59]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[60]  Margaret L. Brandeau,et al.  Dynamic Learning of Patient Response Types: An Application to Treating Chronic Diseases , 2017, Manag. Sci..

[61]  Na Li,et al.  Optimal demand response based on utility maximization in power networks , 2011, 2011 IEEE Power and Energy Society General Meeting.

[62]  Andrew E. B. Lim,et al.  Decentralized Control of a Stochastic Multi-Agent Queueing System , 2012, IEEE Transactions on Automatic Control.

[63]  Peter Palensky,et al.  Demand Side Management: Demand Response, Intelligent Energy Systems, and Smart Loads , 2011, IEEE Transactions on Industrial Informatics.

[64]  Yoshimi Fukuoka,et al.  The mPED randomized controlled clinical trial: applying mobile persuasive technologies to increase physical activity in sedentary women protocol , 2011, BMC public health.

[65]  S. Wartman,et al.  When competent patients make irrational choices. , 1990, The New England journal of medicine.

[66]  David M Nathan,et al.  10-year follow-up of diabetes incidence and weight loss in the Diabetes Prevention Program Outcomes Study. , 2009, Lancet.

[67]  J. Bard,et al.  An algorithm for the discrete bilevel programming problem , 1992 .

[68]  Vedat Verter,et al.  An Analytical Framework for Designing Community‐Based Care for Chronic Diseases , 2011 .

[69]  R. Radner Repeated Principal-Agent Games with Discounting , 1985 .

[70]  Robert R. Bitmead,et al.  Persistence of excitation conditions and the convergence of adaptive schemes , 1984, IEEE Trans. Inf. Theory.

[71]  A. Bandura Health promotion from the perspective of social cognitive theory , 1998 .

[72]  Ramesh Johari,et al.  Information Aggregation and Allocative Efficiency in Smooth Markets , 2010 .

[73]  Allison O'Hair Personalized diabetes management , 2013 .

[74]  Hamed Mohsenian Rad,et al.  Optimal Residential Load Control With Price Prediction in Real-Time Electricity Pricing Environments , 2010, IEEE Transactions on Smart Grid.

[75]  Marco Levorato,et al.  Residential Demand Response Using Reinforcement Learning , 2010, 2010 First IEEE International Conference on Smart Grid Communications.

[76]  Diwakar Gupta,et al.  Revenue Management for a Primary-Care Clinic in the Presence of Patient Choice , 2008, Oper. Res..

[77]  J. Cawley An economic framework for understanding physical activity and eating behaviors. , 2004, American journal of preventive medicine.

[78]  T. Ralphs,et al.  On the Value Function of a Mixed Integer Linear Optimization Problem and an Algorithm for its Construction , 2014 .

[79]  Gah‐Yi Vahn The Data-Driven (s, S) Policy: Why You Can Have Confidence in Censored Demand Data , 2016 .

[80]  Stephen C. Graves,et al.  An LP Planning Model for a Mental Health Community Support System , 2011 .

[81]  Zhibin Jiang,et al.  Optimizing product service system by franchise fee contracts under information asymmetry , 2016, Ann. Oper. Res..

[82]  Dimitris Bertsimas,et al.  On the power and limitations of affine policies in two-stage adaptive optimization , 2012, Math. Program..

[83]  M. Morari,et al.  Stability and stabilization of piecewise affine and hybrid systems: an LMI approach , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[84]  Lora E Burke,et al.  Mobile applications for weight management: theory-based content analysis. , 2013, American journal of preventive medicine.

[85]  Ravindra K. Ahuja,et al.  Inverse Optimization , 2001, Oper. Res..

[86]  M. Conner,et al.  Predicting health behaviour : research and practice with social cognition models , 2005 .

[87]  Stephan Dempe,et al.  Foundations of Bilevel Programming , 2002 .

[88]  Steven H. Low,et al.  Real-time demand response with uncertain renewable energy in smart grid , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[89]  Sarang Deo,et al.  Improving Health Outcomes Through Better Capacity Allocation in a Community-Based Chronic Care Model , 2013, Oper. Res..

[90]  Nan Liu,et al.  Dynamic Scheduling of Outpatient Appointments Under Patient No-Shows and Cancellations , 2010, Manuf. Serv. Oper. Manag..

[91]  Clemens Heuberger,et al.  Inverse Combinatorial Optimization: A Survey on Problems, Methods, and Results , 2004, J. Comb. Optim..