Multi-Objective Group Discovery on the Social Web

We are interested in discovering user groups from collabo-rative rating datasets of the form i, u, s, where i ∈ I, u ∈ U, and s is the integer rating that user u has assigned to item i. Each user has a set of attributes that help find labeled groups such as young computer scientists in France and American female designers. We formalize the problem of finding user groups whose quality is optimized in multiple dimensions and show that it is NP-Complete. We develop α-MOMRI, an α-approximation algorithm, and h-MOMRI, a heuristic-based algorithm , for multi-objective optimization to find high quality groups. Our extensive experiments on real datasets from the social Web examine the performance of our algorithms and report cases where α-MOMRI and h-MOMRI are useful.

[1]  GunopulosDimitrios,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998 .

[2]  Mihalis Yannakakis,et al.  On the approximability of trade-offs and optimal access of Web sources , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[3]  John Crawford,et al.  A multiobjective hybrid evolutionary algorithm for clustering in social networks , 2012, GECCO '12.

[4]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[5]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[6]  Sumit Ganguly,et al.  Query optimization for parallel execution , 1992, SIGMOD '92.

[7]  Christoph F. Eick,et al.  A Framework for Multi-Objective Clustering and Its Application to Co-Location Mining , 2009, ADMA.

[8]  Christoph Koch,et al.  Approximation schemes for many-objective query optimization , 2014, SIGMOD Conference.

[9]  Panos M. Pardalos,et al.  Multilevel Optimization: Algorithms and Applications , 2012 .

[10]  Anil K. Jain,et al.  Multiobjective data clustering , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[11]  Daniel Kahneman,et al.  Probabilistic reasoning , 1993 .

[12]  Krzysztof Rzadca,et al.  Multi-Objective Scheduling , 2009, Introduction to Scheduling.

[13]  Andrei Z. Broder,et al.  Anatomy of the long tail: ordinary people with extraordinary tastes , 2010, WSDM '10.

[14]  Cong Yu,et al.  MRI: Meaningful Interpretations of Collaborative Ratings , 2011, Proc. VLDB Endow..

[15]  Christos D. Zaroliagis,et al.  Multiobjective Optimization: Improved FPTAS for Shortest Paths and Non-Linear Objectives with Applications , 2006, Theory of Computing Systems.

[16]  Aijun An,et al.  Efficient Bi-objective Team Formation in Social Networks , 2012, ECML/PKDD.