CPMetric: Deep Siamese Networks for Metric Learning on Structured Preferences

Preferences are central to decision making by both machines and humans. Representing, learning, and reasoning with preferences is an important area of study both within computer science and across the social sciences. When we give our preferences to an AI system we expect the system to make decisions or recommendations that are consistent with our preferences but the decisions should also adhere to certain norms, guidelines, and ethical principles. Hence, when working with preferences it is necessary to understand and compute a metric (distance) between preferences – especially if we encode both the user preferences and ethical systems in the same formalism. In this paper we investigate the use of CP-nets as a formalism for representing orderings over actions for AI systems. We leverage a recently proposed metric for CP-nets and propose a neural network architecture to learn an approximation of the metric, CPMetric. Using these two tools we look at how one can build a fast and flexible value alignment system (This is an expanded version of our paper, “Metric Learning for Value Alignment” [38]. In this version we have added the classification and regression results and significantly expanded the description of the CPMetric network.).

[1]  Athman Bouguettaya,et al.  Web Service Selection with Incomplete or Inconsistent User Preferences , 2009, ICSOC/ServiceWave.

[2]  Yann Chevaleyre,et al.  Learning Ordinal Preferences on Multiattribute Domains: The Case of CP-nets , 2010, Preference Learning.

[3]  Lirong Xia,et al.  Sequential composition of voting rules in multi-issue domains , 2009, Math. Soc. Sci..

[4]  Gabriella Pigozzi,et al.  Preferences in artificial intelligence , 2016, Annals of Mathematics and Artificial Intelligence.

[5]  Cao Xiao,et al.  FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling , 2018, ICLR.

[6]  Nicholas Mattei,et al.  A behavioral perspective on social choice , 2012, Annals of Mathematics and Artificial Intelligence.

[7]  Ronen I. Brafman,et al.  CP-nets: A Tool for Representing and Reasoning withConditional Ceteris Paribus Preference Statements , 2011, J. Artif. Intell. Res..

[8]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[9]  Miroslaw Truszczynski,et al.  The computational complexity of dominance and consistency in CP-nets , 2005, IJCAI.

[10]  Minyi Li,et al.  An Efficient Algorithm To Compute Distance Between Lexicographic Preference Trees , 2018, IJCAI.

[11]  Eyke Hüllermeier,et al.  Preferences in AI: An overview , 2011, Artif. Intell..

[12]  Francesca Rossi,et al.  Building Ethically Bounded AI , 2018, AAAI.

[13]  Francesca Rossi,et al.  Incorporating Behavioral Constraints in Online AI Systems , 2018, AAAI.

[14]  Francesca Rossi,et al.  Preferences and Ethical Principles in Decision Making , 2018, AAAI Spring Symposia.

[15]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[16]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[17]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Shani Alkoby,et al.  Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR Framework: Extended Abstract , 2019, AAMAS.

[19]  Francesca Rossi,et al.  Beyond Theory and Data in Preference Modeling: Bringing Humans into the Loop , 2015, ADT.

[20]  Francesca Rossi,et al.  Reasoning with PCP-nets in a Multi-Agent Context , 2015, AAMAS.

[21]  Joan Bruna,et al.  Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[22]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[23]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[24]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[25]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[26]  Francesca Rossi,et al.  Bribery in voting with CP-nets , 2013, Annals of Mathematics and Artificial Intelligence.

[27]  Alvin E. Roth,et al.  The Handbook of Experimental Economics, Volume 2 , 2016 .

[28]  Nicholas Mattei,et al.  Uniform Random Generation and Dominance Testing for CP-Nets , 2017, J. Artif. Intell. Res..

[29]  S.J.J. Smith,et al.  Empirical Methods for Artificial Intelligence , 1995 .

[30]  Kush R. Varshney,et al.  Teaching AI agents ethical values using reinforcement learning and policy orchestration , 2019, IBM J. Res. Dev..

[31]  Patrice Perny,et al.  GAI Networks for Utility Elicitation , 2004, KR.

[32]  Li Chen,et al.  Usability Guidelines for Product Recommenders Based on Example Critiquing Research , 2011, Recommender Systems Handbook.

[33]  Toby Walsh,et al.  A Short Introduction to Preferences: Between AI and Social Choice , 2011 .

[34]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[35]  C. Allen,et al.  Moral Machines: Teaching Robots Right from Wrong , 2008 .

[36]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[37]  Ronald Fagin,et al.  Comparing Partial Rankings , 2006, SIAM J. Discret. Math..

[38]  Subbarao Kambhampati Synthesizing Explainable Behavior for Human-AI Collaboration , 2019, AAMAS.

[39]  Francesca Rossi,et al.  Preferences and Ethical Priorities: Thinking Fast and Slow in AI , 2019, AAMAS.

[40]  Nicholas Mattei,et al.  Generating CP-Nets Uniformly at Random , 2016, AAAI.

[41]  Ronen I. Brafman,et al.  Extended Semantics and Optimization Algorithms for CP‐Networks , 2004, Comput. Intell..

[42]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[43]  Henri Prade,et al.  Graphical Models for Preference Representation: An Overview , 2016, SUM.

[44]  Marc Sebban,et al.  A Survey on Metric Learning for Feature Vectors and Structured Data , 2013, ArXiv.

[45]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[46]  Mark D. McDonnell,et al.  Understanding Data Augmentation for Classification: When to Warp? , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[47]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[48]  Eyke Hllermeier,et al.  Preference Learning , 2010 .

[49]  Judy Goldsmith,et al.  Preference Handling for Artificial Intelligence , 2008, AI Mag..

[50]  Francesca Rossi,et al.  On the Distance Between CP-nets , 2018, AAMAS.

[51]  Kihyuk Sohn,et al.  Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[52]  Athman Bouguettaya,et al.  A CP-Net Based Qualitative Composition Approach for an IaaS Provider , 2018, WISE.

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[55]  C. Allen,et al.  Artificial Morality: Top-down, Bottom-up, and Hybrid Approaches , 2005, Ethics and Information Technology.

[56]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[57]  Francesca Rossi,et al.  Updates and Uncertainty in CP-Nets , 2013, Australasian Conference on Artificial Intelligence.

[58]  Francesca Rossi,et al.  Semiring-based constraint satisfaction and optimization , 1997, JACM.

[59]  Francesca Rossi,et al.  Using Contextual Bandits with Behavioral Constraints for Constrained Online Movie Recommendation , 2018, IJCAI.

[60]  Vincent Conitzer,et al.  Handbook of Computational Social Choice , 2016 .

[61]  Stuart J. Russell,et al.  Research Priorities for Robust and Beneficial Artificial Intelligence , 2015, AI Mag..