User Interests Clustering in Business Intelligence Interactions

It is quite common these days for experts, casual analysts, executives or data enthusiasts, to analyze large datasets using user-friendly interfaces on top of Business Intelligence (BI) systems. However, current BI systems do not adequately detect and characterize user interests, which may lead to tedious and unproductive interactions. In this paper, we propose to identify such user interests by characterizing the intent of the interaction with the BI system. With an eye on user modeling for proactive search systems, we identify a set of features for an adequate description of intents, and a similarity measure for grouping intents into coherent interests. We validate experimentally our approach with a user study, where we analyze traces of BI navigation. We show that our similarity measure outperforms a state-of-the-art query similarity measure and yields a very good precision with respect to expressed user interests.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  Matteo Golfarelli,et al.  Mining Preferences from OLAP Query Logs for Proactive Personalization , 2011, ADBIS.

[3]  Wei Chu,et al.  Learning to extract cross-session search tasks , 2013, WWW.

[4]  Yang Song,et al.  Query-Less: Predicting Task Repetition for NextGen Proactive Search and Recommendation Engines , 2016, WWW.

[5]  Esteban Zimányi,et al.  Data Warehouse Systems , 2014, Data-Centric Systems and Applications.

[6]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[7]  Ramanathan V. Guha,et al.  User Modeling for a Personal Assistant , 2015, WSDM.

[8]  Nicholas Jing Yuan,et al.  Contextual Intent Tracking for Personal Assistants , 2016, KDD.

[9]  Milad Shokouhi,et al.  Modeling User Interests for Zero-Query Ranking , 2016, ECIR.

[10]  Klemens Böhm,et al.  Identifying User Interests within the Data Space - a Case Study with SkyServer , 2015, EDBT.

[11]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[12]  Matteo Golfarelli,et al.  Similarity measures for OLAP sessions , 2013, Knowledge and Information Systems.

[13]  Matteo Golfarelli,et al.  A collaborative filtering approach for recommending OLAP sessions , 2015, Decis. Support Syst..

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  Bamshad Mobasher,et al.  Data Mining for Web Personalization , 2007, The Adaptive Web.