Learning Nonsymmetric Determinantal Point Processes

Determinantal point processes (DPPs) have attracted substantial attention as an elegant probabilistic model that captures the balance between quality and diversity within sets. DPPs are conventionally parameterized by a positive semi-definite kernel matrix, and this symmetric kernel encodes only repulsive interactions between items. These so-called symmetric DPPs have significant expressive power, and have been successfully applied to a variety of machine learning tasks, including recommendation systems, information retrieval, and automatic summarization, among many others. Efficient algorithms for learning symmetric DPPs and sampling from these models have been reasonably well studied. However, relatively little attention has been given to nonsymmetric DPPs, which relax the symmetric constraint on the kernel. Nonsymmetric DPPs allow for both repulsive and attractive item interactions, which can significantly improve modeling power, resulting in a model that may better fit for some applications. We present a method that enables a tractable algorithm, based on maximum likelihood estimation, for learning nonsymmetric DPPs from data composed of observed subsets. Our method imposes a particular decomposition of the nonsymmetric kernel that enables such tractable learning algorithms, which we analyze both theoretically and experimentally. We evaluate our model on synthetic and real-world datasets, demonstrating improved predictive performance compared to symmetric DPPs, which have previously shown strong performance on modeling tasks associated with these datasets.

[1]  Jennifer Gillenwater Approximate inference for determinantal point processes , 2014 .

[2]  Nima Anari,et al.  Monte Carlo Markov Chain Algorithms for Sampling Strongly Rayleigh Distributions and Determinantal Point Processes , 2016, COLT.

[3]  Hui Lin,et al.  Learning Mixtures of Submodular Shells with Application to Document Summarization , 2012, UAI.

[4]  Amin Karbasi,et al.  Fast Mixing for Discrete Point Processes , 2015, COLT.

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Victor-Emmanuel Brunel,et al.  Learning Signed Determinantal Point Processes through the Principal Minor Assignment Problem , 2018, NeurIPS.

[7]  J. Møller,et al.  Determinantal point process models and statistical inference , 2012, 1205.4818.

[8]  Ben Taskar,et al.  Learning the Parameters of Determinantal Point Process Kernels , 2014, ICML.

[9]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[10]  Andreas Krause,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[11]  Suvrit Sra,et al.  Fast DPP Sampling for Nystrom with Application to Kernel Methods , 2016, ICML.

[12]  ChengXiang Zhai,et al.  Improving one-class collaborative filtering by incorporating rich user information , 2010, CIKM.

[13]  Ben Taskar,et al.  Determinantal Point Processes for Machine Learning , 2012, Found. Trends Mach. Learn..

[14]  Suvrit Sra,et al.  Kronecker Determinantal Point Processes , 2016, NIPS.

[15]  Suvrit Sra,et al.  Fixed-point algorithms for learning determinantal point processes , 2015, ICML.

[16]  Ben Taskar,et al.  Learning Determinantal Point Processes , 2011, UAI.

[17]  T. Liggett,et al.  Negative dependence and the geometry of polynomials , 2007, 0707.2340.

[18]  Kun Guo,et al.  Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining , 2012 .

[19]  Kristen Grauman,et al.  Large-Margin Determinantal Point Processes , 2014, UAI.

[20]  Ulrich Paquet,et al.  Low-Rank Factorization of Determinantal Point Processes , 2017, AAAI.

[21]  Ankur Moitra,et al.  Rates of estimation for determinantal point processes , 2017, COLT.

[22]  Michael J. Tsatsomeros,et al.  Generating and detecting matrices with positive principal minors , 2004 .

[23]  Alexei Borodin,et al.  Determinantal point processes , 2009, 0911.1153.

[24]  Ben Taskar,et al.  Expectation-Maximization for Learning Determinantal Point Processes , 2014, NIPS.

[25]  Ulrich Paquet,et al.  Bayesian Low-Rank Determinantal Point Processes , 2016, RecSys.

[26]  Francis R. Bach,et al.  Learning Determinantal Point Processes in Sublinear Time , 2016, AISTATS.

[27]  Ben Taskar,et al.  k-DPPs: Fixed-Size Determinantal Point Processes , 2011, ICML.

[28]  Hedvig Kjellström,et al.  Stochastic Learning on Imbalanced Data: Determinantal Point Processes for Mini-batch Diversification , 2017, ArXiv.