论文信息 - Relational Boosted Bandits

Relational Boosted Bandits

Contextual bandits algorithms have become essential in real-world user interaction problems in recent years. However, these algorithms rely on context as attribute value representation, which makes them unfeasible for real-world domains like social networks are inherently relational. We propose Relational Boosted Bandits(RB2), acontextual bandits algorithm for relational domains based on (relational) boosted trees. RB2 enables us to learn interpretable and explainable models due to the more descriptive nature of the relational representation. We empirically demonstrate the effectiveness and interpretability of RB2 on tasks such as link prediction, relational classification, and recommendations.

Balaraman Ravindran | Sriraam Natarajan | Ashutosh Kakadiya

[1] Djallel Bouneffouf,et al. A Survey on Practical Applications of Multi-Armed and Contextual Bandits , 2019, ArXiv.

[2] Kristian Kersting,et al. Imitation Learning in Relational Domains: A Functional-Gradient Boosting Approach , 2011, IJCAI.

[3] Kristian Kersting,et al. Gradient-based boosting for statistical relational learning: The relational dependency network case , 2011, Machine Learning.

[4] Joelle Pineau,et al. Contextual Bandits for Adapting Treatment in a Mouse Model of de Novo Carcinogenesis , 2018, MLHC.

[5] Luc De Raedt,et al. Bayesian Logic Programming: Theory and Tool , 2007 .

[6] Shuo Yang,et al. Identifying Rare Diseases from Behavioural Data: A Machine Learning Approach , 2016, 2016 IEEE First International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[7] Kristian Kersting,et al. Generalized First Order Decision Diagrams for First Order Markov Decision Processes , 2009, IJCAI.

[8] Jude W. Shavlik,et al. in Advances in Neural Information Processing , 1996 .

[9] Raymond J. Mooney,et al. Bottom-up learning of Markov logic network structure , 2007, ICML '07.

[10] Lise Getoor,et al. Learning Probabilistic Relational Models , 1999, IJCAI.

[11] Raymond J. Mooney,et al. Online Structure Learning for Markov Logic Networks , 2011, ECML/PKDD.

[12] Ole J. Mengshoel,et al. Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models , 2017, AAAI.

[13] Li Zhou,et al. A Survey on Contextual Multi-armed Bandits , 2015, ArXiv.

[14] Matthew Richardson,et al. Markov logic networks , 2006, Machine Learning.

[15] John Langford,et al. Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits , 2014, ICML.

[16] J. Friedman. Greedy function approximation: A gradient boosting machine. , 2001 .

[17] John Langford,et al. A Contextual Bandit Bake-off , 2018, J. Mach. Learn. Res..

[18] John Langford,et al. The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information , 2007, NIPS.