Using program synthesis for social recommendations

This paper presents a new approach to select events of interest to users in a social media setting where events are generated from mobile devices. We argue that the problem is best solved by inductive learning, where the goal is to first generalize from the users' expressed "likes" and "dislikes" of specific events, then to produce a program that can be used to collect only data of interest. The key contribution of this paper is a new algorithm that combines machine learning techniques with program synthesis technology to learn users' preferences. We show that when compared with the more standard approaches, our new algorithm provides up to order-of-magnitude reductions in model training time, and significantly higher prediction accuracies for our target application.1

[1]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[2]  K. I. Ramachandran,et al.  Feature selection using Decision Tree and classification through Proximal Support Vector Machine for fault diagnostics of roller bearing , 2007 .

[3]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[4]  Nitesh V. Chawla,et al.  Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.

[5]  Sumit Gulwani,et al.  Automating string processing in spreadsheets using input-output examples , 2011, POPL '11.

[6]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[7]  Sachin Garg,et al.  Response prediction using collaborative filtering with hierarchies and side-information , 2011, KDD.

[8]  Sanjit A. Seshia,et al.  Combinatorial sketching for finite programs , 2006, ASPLOS XII.

[9]  Daphne Koller,et al.  Support Vector Machine Active Learning with Application sto Text Classification , 2000, ICML.

[10]  Brian D. Davison,et al.  A probabilistic model for personalized tag prediction , 2010, KDD.

[11]  Armando Solar-Lezama,et al.  Sketching concurrent data structures , 2008, PLDI '08.

[12]  Sumit Gulwani,et al.  Path-based inductive synthesis for program inversion , 2011, PLDI '11.

[13]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[14]  Hui Xiong,et al.  Enhancing recommender systems under volatile userinterest drifts , 2009, CIKM.

[15]  Nils J. Nilsson,et al.  Probabilistic Logic * , 2022 .

[16]  Ben Taskar,et al.  Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning) , 2007 .

[17]  Lei Tang,et al.  Large-scale behavioral targeting with a social twist , 2011, CIKM '11.

[18]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[19]  Ee-Peng Lim,et al.  Trust network inference for online rating data using generative models , 2010, KDD.

[20]  Rishabh Singh,et al.  Synthesizing data structure manipulations from storyboards , 2011, ESEC/FSE '11.

[21]  Sumit Gulwani,et al.  Oracle-guided component-based program synthesis , 2010, 2010 ACM/IEEE 32nd International Conference on Software Engineering.

[22]  Alvin Cheung,et al.  Demo: Automatically generating interesting events with LifeJoin , 2011, SenSys.

[23]  Yehuda Koren,et al.  Modeling relationships at multiple scales to improve accuracy of large recommender systems , 2007, KDD '07.

[24]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[25]  J. Ross Quinlan,et al.  Learning logical definitions from relations , 1990, Machine Learning.

[26]  Ben Taskar,et al.  Introduction to statistical relational learning , 2007 .

[27]  Lin Zhong,et al.  Self-constructive high-rate system energy modeling for battery-powered mobile systems , 2011, MobiSys '11.

[28]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[29]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.