Semantics and Usage Statistics for Multi-dimensional Query Expansion

As the amount and complexity of data keep increasing in data warehouses, their exploration for analytical purposes may be hindered. Recommender systems have grown very popular on the Web with sites like Amazon, Netflix, etc. These systems proved successful to help users explore available content related to what they are currently looking at. Recent systems consider the use of recommendation techniques to suggest data warehouse queries and help an analyst pursue its exploration. In this paper, we present a personalized query expansion component which suggests measures and dimensions to iteratively build consistent queries over a data warehouse. Our approach leverages (a) semantics defined in multi-dimensional domain models, (b) collaborative usage statistics derived from existing repositories of Business Intelligence documents like dashboards and reports and (c) preferences defined in a user profile. We finally present results obtained with a prototype implementation of an interactive query designer.

[1]  Sunita Sarawagi,et al.  User-Adaptive Exploration of Multidimensional Data , 2000, VLDB.

[2]  Diego Calvanese,et al.  Discovering functional dependencies for multidimensional design , 2009, DOLAP.

[3]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[4]  Laila Niedrite,et al.  OLAP Personalization with User-Describing Profiles , 2010, BIR.

[5]  Olivier Teste,et al.  Preference-Based Recommendations for OLAP Analysis , 2009, DaWaK.

[6]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Patrick Marcel,et al.  A survey of query recommendation techniques for data warehouse exploration , 2011, EDA.

[8]  Arnaud Giacometti,et al.  Query recommendations for OLAP discovery driven analysis , 2009, DOLAP.

[9]  Arnaud Giacometti,et al.  A personalization framework for OLAP queries , 2005, DOLAP '05.

[10]  Arnaud Giacometti,et al.  A framework for recommending OLAP queries , 2008, DOLAP '08.

[11]  Martin Ester,et al.  TrustWalker: a random walk model for combining trust-based and item-based recommendation , 2009, KDD.

[12]  Peter Forbrig,et al.  Perspectives in Business Informatics Research - 9th International Conference, BIR 2010, Rostock Germany, September 29-October 1, 2010. Proceedings , 2010, BIR.

[13]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[14]  Mukesh K. Mohania,et al.  Enhanced Business Intelligence using EROCS , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[15]  Matteo Golfarelli,et al.  myOLAP: An Approach to Express and Evaluate OLAP Preferences , 2011, IEEE Transactions on Knowledge and Data Engineering.