CiteSight: supporting contextual citation recommendation using differential search

A person often uses a single search engine for very different tasks. For example, an author editing a manuscript may use the same academic search engine to find the latest work on a particular topic or to find the correct citation for a familiar article. The author's tolerance for latency and accuracy may vary according to task. However, search engines typically employ a consistent approach for processing all queries. In this paper we explore how a range of search needs and expectations can be supported within a single search system using differential search. We introduce CiteSight, a system that provides personalized citation recommendations to author groups that vary based on task. CiteSight presents cached recommendations instantaneously for online tasks (e.g., active paper writing), and refines these recommendations in the background for offline tasks (e.g., future literature review). We develop an active cache-warming process to enhance the system as the author works, and context-coupling, a technique for augment sparse citation networks. By evaluating the quality of the recommendations and collecting user feedback, we show that differential search can provide a high level of accuracy for different tasks on different time scales. We believe that differential search can be used in many situations where the user's tolerance for latency and desired response vary dramatically based on use.

[1]  C McFarlaneDaniel,et al.  The scope and importance of human interruption in human-computer interaction design , 2002 .

[2]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[3]  W. Bruce Croft,et al.  Recommending citations for academic papers , 2007, SIGIR.

[4]  Gary Marchionini,et al.  Exploratory search and HCI: designing and evaluating interfaces to support exploratory search interaction , 2007, CHI Extended Abstracts.

[5]  Howard D. White,et al.  Authors as citers over time , 2001, J. Assoc. Inf. Sci. Technol..

[6]  Wenyi Huang,et al.  Recommending citations: translating papers into references , 2012, CIKM.

[7]  Hongfei Yan,et al.  Recommending citations with translation model , 2011, CIKM '11.

[8]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[9]  Nigel Harwood,et al.  Publication outlets and their effect on academic writers’ citations , 2008, Scientometrics.

[10]  Jasmine Novak,et al.  Building enriched document representations using aggregated anchor text , 2009, SIGIR.

[11]  Desney S. Tan,et al.  CueTIP: a mixed-initiative interface for correcting handwriting errors , 2006, UIST.

[12]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[13]  Jian-Yun Nie,et al.  Position-Aligned Translation Model for Citation Recommendation , 2012, SPIRE.

[14]  Hongyuan Zha,et al.  A General Boosting Method and its Application to Learning Ranking Functions for Web Search , 2007, NIPS.

[15]  Eric Horvitz,et al.  Principles of mixed-initiative user interfaces , 1999, CHI '99.

[16]  Prasenjit Mitra,et al.  Utilizing Context in Generative Bayesian Models for Linked Corpus , 2010, AAAI.

[17]  S. Baldi Normative versus social constructivist processes in the allocation of citations : A network-analytic model , 1998 .

[18]  Daniel Jurafsky,et al.  Who should I cite: learning literature search models from citation behavior , 2010, CIKM.

[19]  Lutz Bornmann,et al.  What do citation counts measure? A review of studies on citing behavior , 2008, J. Documentation.

[20]  Susan T. Dumais,et al.  Personalizing Search via Automated Analysis of Interests and Activities , 2005, SIGIR.

[21]  Ümit V. Çatalyürek,et al.  Diversified recommendation on graphs: pitfalls, measures, and algorithms , 2013, WWW.

[22]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[23]  Jöran Beel,et al.  Scienstein : A Research Paper Recommender System , 2009 .

[24]  Roberto I. González-Ibáñez,et al.  Exploring information seeking processes in collaborative search tasks , 2010, ASIST.

[25]  Thad Starner,et al.  Remembrance Agent: A Continuously Running Automated Information Retrieval System , 1996, PAAM.

[26]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[27]  Ryen W. White,et al.  Slow Search: Information Retrieval without Time Constraints , 2013, HCIR '13.

[28]  Robert B. Miller,et al.  Response time in man-computer conversational transactions , 1899, AFIPS Fall Joint Computing Conference.

[29]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[30]  Jian Pei,et al.  Citation recommendation without author supervision , 2011, WSDM '11.

[31]  Sean M. McNee,et al.  On the recommending of citations for research papers , 2002, CSCW '02.

[32]  Jie Tang,et al.  A Discriminative Approach to Topic-Based Citation Recommendation , 2009, PAKDD.

[33]  Kristian J. Hammond,et al.  Watson: Anticipating and Contextualizing Information Needs , 1999 .

[34]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[35]  Susan T. Dumais,et al.  Large scale analysis of web revisitation patterns , 2008, CHI.

[36]  Kara A. Latorella,et al.  The Scope and Importance of Human Interruption in Human-Computer Interaction Design , 2002, Hum. Comput. Interact..

[37]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens+ , 2004, JCDL.

[38]  Xiaolong Zhang,et al.  CiteSense: supporting sensemaking of research literature , 2008, CHI.

[39]  Susan T. Dumais,et al.  Implicit queries (IQ) for contextualized search , 2004, SIGIR '04.

[40]  Meredith Ringel Morris,et al.  Discovering and using groups to improve personalized search , 2009, WSDM '09.

[41]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[42]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.

[43]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[44]  Ryen W. White,et al.  Exploratory Search , 2008 .

[45]  J. Lafferty,et al.  Mixed-membership models of scientific publications , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Chirag Shah,et al.  Algorithmic mediation for collaborative exploratory search , 2008, SIGIR '08.