Navigating information spaces: A case study of related article search in PubMed

The concept of an ''information space'' provides a powerful metaphor for guiding the design of interactive retrieval systems. We present a case study of related article search, a browsing tool designed to help users navigate the information space defined by results of the PubMed^(R) search engine. This feature leverages content-similarity links that tie MEDLINE^(R) citations together in a vast document network. We examine the effectiveness of related article search from two perspectives: a topological analysis of networks generated from information needs represented in the TREC 2005 genomics track and a query log analysis of real PubMed users. Together, data suggest that related article search is a useful feature and that browsing related articles has become an integral part of how users interact with PubMed.

[1]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[2]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[3]  S. Robertson The probability ranking principle in IR , 1997 .

[4]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[5]  Marti A. Hearst,et al.  TREC 2007 Genomics Track Overview , 2007, TREC.

[6]  James Allan,et al.  Measuring the Navigability of Document Networks , 2007 .

[7]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[8]  Peter Pirolli,et al.  Information Foraging , 2009, Encyclopedia of Database Systems.

[9]  Marti A. Hearst Clustering versus faceted categories for information exploration , 2006, Commun. ACM.

[10]  Paul P. Maglio,et al.  The conceptual structure of information space , 2003 .

[11]  Hsinchun Chen,et al.  Medical Informatics: Knowledge Management and Data Mining in Biomedicine (Operations Research/Computer Science Interfaces) , 2005 .

[12]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[13]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[14]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[15]  Donna K. Harman,et al.  The TREC Test Collections , 2005 .

[16]  Gobinda G. Chowdhury,et al.  TREC: Experiment and Evaluation in Information Retrieval , 2007 .

[17]  W. John Wilbur,et al.  The Effectiveness of Document Neighboring in Search Enhancement , 1994, Inf. Process. Manag..

[18]  George W. Furnas,et al.  Effective view navigation , 1997, CHI.

[19]  Ellen M. Vdorhees The cluster hypothesis revisited , 1985, SIGIR 1985.

[20]  Jimmy J. Lin,et al.  PubMed related articles: a probabilistic topic-based model for content similarity , 2007, BMC Bioinformatics.

[21]  Wei-Ying Ma,et al.  Query Expansion by Mining User Logs , 2003, IEEE Trans. Knowl. Data Eng..

[22]  Susan T. Dumais,et al.  Optimizing search by showing results in context , 2001, CHI.

[23]  W. Pratt,et al.  The usefulness of dynamically categorizing search results. , 2000, Journal of the American Medical Informatics Association : JAMIA.

[24]  Fernando Diaz,et al.  Regularizing query-based retrieval scores , 2007, Information Retrieval.

[25]  James Allan,et al.  Strategy-based interactive cluster visualization for information retrieval , 2000, International Journal on Digital Libraries.

[26]  Ellen M. Vdorhees,et al.  The cluster hypothesis revisited , 1985, SIGIR '85.

[27]  W. John Wilbur,et al.  Modeling Text Retrieval in Biomedicine , 2005 .

[28]  James Allan,et al.  Find-similar: similarity browsing as a search tool , 2006, SIGIR.

[29]  W. Bruce Croft,et al.  Cluster-based retrieval using language models , 2004, SIGIR '04.

[30]  Ellen M. Voorhees,et al.  The fourteenth text retrieval conference TREC 2005 , 2006 .

[31]  Ben Shneiderman,et al.  Balancing Systematic and Flexible Exploration of Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[32]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[33]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[34]  Kristina Höök,et al.  Designing Information Spaces: The Social Navigation Approach , 2003, Computer Supported Cooperative Work.

[35]  Jimmy J. Lin,et al.  Answer Extraction, Semantic Clustering, and Extractive Summarization for Clinical Question Answering , 2006, ACL.

[36]  Paul P. Maglio,et al.  The Conceptual Structure of Information Space , 2003, Designing Information Spaces.