Query recommendation in the information domain of children

Children represent an increasing group of web users. Some of the key problems that hamper their search experience is their limited vocabulary, their difficulty in using the right keywords, and the inappropriateness of their general‐purpose query suggestions. In this work, we propose a method that uses tags from social media to suggest queries related to children's topics. Concretely, we propose a simple yet effective approach to bias a random walk defined on a bipartite graph of web resources and tags through keywords that are more commonly used to describe resources for children. We evaluate our method using a large query log sample of queries submitted by children. We show that our method outperforms by a large margin the query suggestions of modern search engines and state‐of‐the art query suggestions based on random walks. We improve further the quality of the ranking by combining the score of the random walk with topical and language modeling features to emphasize even more the child‐related aspects of the query suggestions.

[1]  Djoerd Hiemstra,et al.  Query log analysis in the context of information retrieval for children , 2010, SIGIR '10.

[2]  Leif Azzopardi,et al.  MaSe: create your own mash-up search interface , 2012, SIGIR '12.

[3]  Leif Azzopardi,et al.  YooSee: a video browsing application for young children , 2012, SIGIR '12.

[4]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[5]  Ophir Frieder,et al.  Query Phrase Suggestion from Topically Tagged Session Logs , 2006, FQAS.

[6]  Yiqun Liu,et al.  Automatically generating labels based on unified click model , 2011, WWW.

[7]  Dania Bilal,et al.  Children's use of the Yahooligans! Web search engine. III. Cognitive and physical behaviors on fully self-generated search tasks , 2002, J. Assoc. Inf. Sci. Technol..

[8]  Marie-Francine Moens,et al.  Clash of the Typings - Finding Controversies and Children's Topics Within Queries , 2011, ECIR.

[9]  Nick Craswell,et al.  Random walks on the click graph , 2007, SIGIR.

[10]  Marie-Francine Moens,et al.  Wisdom of the ages: toward delivering the children's web with the link-based agerank algorithm , 2010, CIKM.

[11]  Wolfgang Nejdl,et al.  Utility analysis for topically biased PageRank , 2007, WWW '07.

[12]  Djoerd Hiemstra,et al.  Query recommendation for children , 2012, CIKM '12.

[13]  Steven C. H. Hoi,et al.  A two-view learning approach for image tag ranking , 2011, WSDM '11.

[14]  Dong Liu,et al.  Tag ranking , 2009, WWW '09.

[15]  Roi Blanco,et al.  Probabilistic static pruning of inverted files , 2010, TOIS.

[16]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[17]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[18]  Ingmar Weber,et al.  What and how children search on the web , 2011, CIKM '11.

[19]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[20]  C. Bauckhage,et al.  Analyzing Social Bookmarking Systems : A del . icio . us Cookbook , 2008 .

[21]  Ben He,et al.  Sponsored Search Ad Selection by Keyword Structure Analysis , 2013, ECIR.

[22]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[23]  Lidong Bing,et al.  Using query log and social tagging to refine queries based on latent topics , 2011, CIKM '11.

[24]  ChengXiang Zhai,et al.  Mining term association patterns from search logs for effective query reformulation , 2008, CIKM '08.

[25]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[26]  Xianchao Zhang,et al.  Automatic seed set expansion for trust propagation based anti-spamming algorithms , 2009, WIDM.

[27]  Brian D. Davison,et al.  Explorations in tag suggestion and query expansion , 2008, SSM '08.

[28]  Ariel Fuxman,et al.  Using the wisdom of the crowds for keyword generation , 2008, WWW.

[29]  Tony Abou-Assaleh,et al.  A link-based ranking scheme for focused search , 2007, WWW '07.

[30]  Feng Qiu,et al.  Automatic identification of user interest for personalized search , 2006, WWW '06.

[31]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[32]  Congyan Lang,et al.  Towards relevance and saliency ranking of image tags , 2012, ACM Multimedia.

[33]  Elizabeth Foss,et al.  How children search the internet with keyword interfaces , 2009, IDC.

[34]  Satoshi Nakamura,et al.  Can social bookmarking enhance search in the web? , 2007, JCDL '07.

[35]  Francesco Bonchi,et al.  Query suggestions using query-flow graphs , 2009, WSCD '09.

[36]  Baoning Wu,et al.  Extracting link spam using biased random walks from spam seed sets , 2007, AIRWeb '07.

[37]  Tim van de Cruys Two Multivariate Generalizations of Pointwise Mutual Information , 2011, Proceedings of the Workshop on Distributional Semantics and Compositionality.

[38]  Wei Gao,et al.  Exploiting query logs for cross-lingual query suggestions , 2010, TOIS.

[39]  Hongfei Lin,et al.  Selecting related terms in query-logs using two-stage SimRank , 2011, CIKM '11.

[40]  Haojie Li,et al.  Tag ranking by propagating relevance over tag and image graphs , 2012, ICIMCS '12.

[41]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[42]  Arjen P. de Vries,et al.  Web page classification on child suitability , 2010, CIKM.

[43]  Ahmed Hassan Awadallah,et al.  Beyond DCG: user behavior as a predictor of a successful search , 2010, WSDM '10.

[44]  Aristides Gionis,et al.  Improving recommendation for long-tail queries via templates , 2011, WWW.

[45]  Shuicheng Yan,et al.  Image tag refinement towards low-rank, content-tag prior and error sparsity , 2010, ACM Multimedia.

[46]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[47]  Andrei Z. Broder,et al.  Automatic generation of bid phrases for online advertising , 2010, WSDM '10.

[48]  BilalDania Children's use of the Yahooligans! Web search engine , 2001 .

[49]  Marie-Francine Moens,et al.  EmSe: Supporting Children's Information Needs within a Hospital Environment , 2012, ECIR.