Towards Real Time Team Optimization

Teams can be often viewed as a dynamic system where the team configuration evolves over time (e.g., new members join the team; existing members leave the team; the skills of the members improve over time). Consequently, the performance of the team might be changing due to such team dynamics. A natural question is how to plan the (re-)staffing actions (e.g., recruiting a new team member) at each time step so as to maximize the expected cumulative performance of the team. In this paper, we address the problem of real-time team optimization by intelligently selecting the best candidates towards increasing the similarity between the current team and the high-performance teams according to the team configuration at each time-step. The key idea is to formulate it as a Markov Decision process (MDP) problem and leverage recent advances in reinforcement learning to optimize the team dynamically. The proposed method bears two main advantages, including (1) dynamics, being able to model the dynamics of the team to optimize the initial team towards the direction of a high-performance team via performance feedback; (2) efficacy, being able to handle the large state/action space via deep reinforcement learning based value estimation. We demonstrate the effectiveness of the proposed method through extensive empirical evaluations.

[1]  Hanghang Tong,et al.  Enhancing Team Composition in Professional Networks: Problem Definitions and Fast Solutions , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[3]  Yehuda Koren,et al.  Web-Scale Media Recommendation Systems , 2012, Proceedings of the IEEE.

[4]  Zhiyuan Liu,et al.  Graph Neural Networks: A Review of Methods and Applications , 2018, AI Open.

[5]  Francesco Ricci,et al.  Improving recommender systems with adaptive conversational strategies , 2009, HT '09.

[6]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[7]  Theodoros Lappas,et al.  Finding a team of experts in social networks , 2009, KDD.

[8]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[9]  Michael I. Jordan,et al.  PEGASUS: A policy search method for large MDPs and POMDPs , 2000, UAI.

[10]  M. de Rijke,et al.  Expertise Retrieval , 2012, Found. Trends Inf. Retr..

[11]  Yishay Mansour,et al.  A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes , 1999, Machine Learning.

[12]  Huan Liu,et al.  Deep Anomaly Detection on Attributed Networks , 2019, SDM.

[13]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[14]  Peter Sunehag,et al.  Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions , 2015, ArXiv.

[15]  Zhenhui Li,et al.  IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control , 2018, KDD.

[16]  ChengXiang Zhai,et al.  Constrained multi-aspect expertise matching for committee review assignment , 2009, CIKM.

[17]  Matthew T. Bowers,et al.  Team Dynamics: A Social Network Perspective , 2012 .

[18]  Ahmad A. Kardan,et al.  A hybrid web recommender system based on Q-learning , 2008, SAC '08.

[19]  Saeed Shiry Ghidary,et al.  Usage-based web recommendations: a reinforcement learning approach , 2007, RecSys '07.

[20]  Andrew McCallum,et al.  Expertise modeling for matching papers with reviewers , 2007, KDD '07.

[21]  Craig Boutilier,et al.  VDCBPI: an Approximate Scalable Algorithm for Large POMDPs , 2004, NIPS.

[22]  Tim Weitzel,et al.  Decision support for team staffing: An automated relational recommendation approach , 2008, Decis. Support Syst..

[23]  Guy Shani,et al.  An MDP-Based Recommender System , 2002, J. Mach. Learn. Res..

[24]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[25]  Milos Hauskrecht,et al.  Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes , 1997, AAAI/IAAI.

[26]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[27]  Hongbo Deng,et al.  Formal Models for Expert Finding on DBLP Bibliography Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[28]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[29]  Hamid Beigy,et al.  Expertise retrieval in bibliographic network: a topic dominance learning approach , 2013, CIKM.

[30]  Luca Becchetti,et al.  Online team formation in social networks , 2012, WWW.

[31]  B. Tuckman DEVELOPMENTAL SEQUENCE IN SMALL GROUPS. , 1965, Psychological bulletin.

[32]  James Noble,et al.  Understanding Team Dynamics in Distributed Agile Software Development , 2012, XP.

[33]  B. B. Morgan,et al.  An Analysis of Team Evolution and Maturation , 1993 .

[34]  Hanghang Tong,et al.  Replacing the Irreplaceable: Fast Algorithms for Team Member Recommendation , 2014, WWW.

[35]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.