POMDPs for Assisting Homeless Shelters - Computational and Deployment Challenges

This paper looks at challenges faced during the ongoing deployment of HEALER, a POMDP based software agent that recommends sequential intervention plans for use by homeless shelters, who organize these interventions to raise awareness about HIV among homeless youth. HEALER’s sequential plans (built using knowledge of social networks of homeless youth) choose intervention participants strategically to maximize influence spread, while reasoning about uncertainties in the network. In order to compute its plans, HEALER (i) casts this influence maximization problem as a POMDP and solves it using a novel planner which scales up to previously unsolvable real-world sizes; (ii) and constructs social networks of homeless youth at low cost, using a Facebook application. HEALER is currently being deployed in the real world in collaboration with a homeless shelter. Initial feedback from the shelter officials has been positive but they were surprised by the solutions generated by HEALER as these solutions are very counter-intuitive. Therefore, there is a need to justify HEALER’s solutions in a way that mirrors the officials’ intuition. In this paper, we report on progress made towards HEALER’s deployment and detail first steps taken to tackle the issue of explaining HEALER’s solutions.

[1]  E. Rice The Positive Role of Social Networks and Social Networking Technology in the Condom-Using Behaviors of Homeless Young People , 2010, Public health reports.

[2]  George Karypis,et al.  Multi-threaded Graph Partitioning , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[3]  Susanne Biundo-Stephan,et al.  Making Hybrid Plans More Clear to Human Users - A Formal Approach for Generating Sound Explanations , 2012, ICAPS.

[4]  Xiaokui Xiao,et al.  Influence maximization: near-optimal time complexity meets practical efficiency , 2014, SIGMOD Conference.

[5]  Edith Cohen,et al.  Sketch-based Influence Maximization and Computation: Scaling up with Guarantees , 2014, CIKM.

[6]  Joel Veness,et al.  Monte-Carlo Planning in Large POMDPs , 2010, NIPS.

[7]  Pascal Poupart,et al.  Minimal Sufficient Explanations for Factored Markov Decision Processes , 2009, ICAPS.

[8]  Joelle Pineau,et al.  Online Planning Algorithms for POMDPs , 2008, J. Artif. Intell. Res..

[9]  Leandro Soriano Marcolino,et al.  Simultaneous Influencing and Mapping Social Networks: (Extended Abstract) , 2016, AAMAS.

[10]  E. Laumann,et al.  A new HIV prevention network approach: sociometric peer change agent selection. , 2015, Social science & medicine.

[11]  Jure Leskovec,et al.  The Network Completion Problem: Inferring Missing Nodes and Edges in Networks , 2011, SDM.

[12]  Leen-Kiat Soh,et al.  To Ask, Sense, or Share: Ad Hoc Information Gathering , 2015, AAMAS.

[13]  Eric Rice,et al.  Online Social Networking Technologies, HIV Knowledge, and Sexual Risk and Testing Behaviors Among Homeless Youth , 2010, AIDS and Behavior.

[14]  T. Valente,et al.  Identifying Opinion Leaders to Promote Behavior Change , 2007, Health education & behavior : the official publication of the Society for Public Health Education.

[15]  Kee-Eung Kim,et al.  Closing the Gap: Improved Bounds on Optimal POMDP Solutions , 2011, ICAPS.

[16]  N. Milburn,et al.  Position-specific HIV risk in a large network of homeless youths. , 2012, American journal of public health.

[17]  E. Rice,et al.  Sexuality and homelessness in Los Angeles public schools. , 2012, American journal of public health.

[18]  Andreas Krause,et al.  Adaptive Submodularity: Theory and Applications in Active Learning and Stochastic Optimization , 2010, J. Artif. Intell. Res..

[19]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[20]  R. Winett,et al.  Randomised, controlled, community-level HIV-prevention intervention for sexual-risk behaviour among homosexual men in US cities , 1997, The Lancet.

[21]  N. Milburn,et al.  Mobilizing homeless youth for HIV prevention: a social network analysis of the acceptability of a face-to-face and online social networking intervention. , 2012, Health education research.

[22]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[23]  Dongqing Yang,et al.  Influence Maximizing and Local Influenced Community Detection Based on Multiple Spread Model , 2011, ADMA.

[24]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[25]  Leandro Soriano Marcolino,et al.  Preventing HIV Spread in Homeless Populations Using PSINET , 2015, AAAI.

[26]  Camille Roth,et al.  How Realistic Should Knowledge Diffusion Models Be? , 2007, J. Artif. Soc. Soc. Simul..

[27]  Haifeng Xu,et al.  Using Social Networks to Aid Homeless Shelters: Dynamic Influence Maximization under Uncertainty , 2016, AAMAS.

[28]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[29]  Brahim Chaib-draa,et al.  An online POMDP algorithm for complex multiagent environments , 2005, AAMAS '05.

[30]  Nikos A. Vlassis,et al.  Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..