Author-topic profiles for academic search

We implemented and evaluated a two-stage retrieval method for personalized academic search in which the initial search results are re-ranked using an author-topic profile. In academic search tasks, the user's own data can help optimizing the ranking of search results to match the searcher's specific individual needs. The author-topic profile consists of topic-specific terms, stored in a graph. We re-rank the top-1000 retrieved documents using ten features that represent the similarity between the document and the author-topic graph. We found that the re-ranking gives a small but significant improvement over the reproduced best method from the literature. Storing the profile as a graph has a number of advantages: it is flexible with respect to node and relation types; it is a visualization of knowledge that is interpretable by the user, and it offers the possibility to view relational characteristics of individual nodes.

[1]  Oren Kurland,et al.  Re-ranking search results using document-passage graphs , 2008, SIGIR '08.

[2]  Laurence Anthony F. Park,et al.  Score adjustment for correction of pooling bias , 2009, SIGIR.

[3]  Muhammad Ali Norozi,et al.  Contextualization from the Bibliographic Structure , 2012 .

[4]  Birger Larsen,et al.  Exploiting Citation Contexts for Physics Retrieval , 2015, BIR@ECIR.

[5]  Wessel Kraaij,et al.  Query Term Suggestion in Academic Search , 2014, ECIR.

[6]  Jie Tang,et al.  A Combination Approach to Web User Profiling , 2010, TKDD.

[7]  Alexander Pretschner,et al.  Ontology based personalized search , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[8]  Christina Lioma,et al.  Preliminary study of technical terminology for the retrieval of scientific book metadata records , 2012, SIGIR '12.

[9]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[10]  Gilles Louppe,et al.  Gradient Boosted Regression Trees in Scikit-Learn , 2014 .

[11]  Bradley M. Hemminger,et al.  Information seeking behavior of academic scientists , 2007, J. Assoc. Inf. Sci. Technol..

[12]  Mohand Boughanem,et al.  Towards a graph-based user profile modeling for a session-based personalized search , 2009, Knowledge and Information Systems.

[13]  ChengXiang Zhai,et al.  Implicit user modeling for personalized search , 2005, CIKM '05.

[14]  Anthony Jameson,et al.  User Modeling and User-Adapted Interaction , 2004, User Modeling and User-Adapted Interaction.

[15]  Tamar Sadeh Optimizing Relevance Ranking to Enhance the User's Discovery Experience , 2013, IRCDL.

[16]  Christina Lioma,et al.  Preliminary experiments using subjective logic for the polyrepresentation of information needs , 2012, IIiX.

[17]  Katia P. Sycara,et al.  WebMate: a personal agent for browsing and searching , 1998, AGENTS '98.

[18]  Alessandro Micarelli,et al.  Anatomy and Empirical Evaluation of an Adaptive Web-Based Information Filtering System , 2004, User Modeling and User-Adapted Interaction.

[19]  Toine Bogers,et al.  An Exploration of Retrieval-Enhancing Methods for Integrated Search in a Digital Library , 2012 .

[20]  Matthew Hurst,et al.  A Language Model Approach to Keyphrase Extraction , 2003, ACL 2003.

[21]  Helen Ashman,et al.  Examining Personalization in Academic Web Search , 2015, HT.

[22]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[23]  NgWilfred,et al.  Personalized Concept-Based Clustering of Search Engine Queries , 2008 .

[24]  Vincent P. Wade,et al.  Personalised Information Retrieval: survey and classification , 2013, User Modeling and User-Adapted Interaction.

[25]  Kenneth Wai-Ting Leung,et al.  Personalized Concept-Based Clustering of Search Engine Queries , 2008, IEEE Transactions on Knowledge and Data Engineering.

[26]  Wessel Kraaij,et al.  Evaluation and analysis of term scoring methods for term extraction , 2016, Information Retrieval Journal.

[27]  Xiaohua Hu,et al.  Language Model Document Priors based on Citation and Co-citation Analysis , 2014, BIR@ECIR.

[28]  Francisco Tanudjaja,et al.  Persona: a contextualized and personalized web search , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[29]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[30]  Christina Lioma,et al.  Sense discrimination for physics retrieval , 2011, SIGIR '11.

[31]  W. Bruce Croft,et al.  Automatic boolean query suggestion for professional search , 2011, SIGIR.

[32]  Dong Zhou,et al.  Improving search via personalized query expansion using social media , 2012, Information Retrieval.

[33]  Susan Gauch,et al.  Personalizing Search Based on User Search Histories , 2004 .