A Knowledge-Based Approach to Organizing Retrieved Documents

When people use computer-based tools to find answers to general questions, they often are faced with a daunting list of search results or "hits" returned by the search engine. Many search tools address this problem by helping users to make their searches more specific. However, when dozens or hundreds of documents are relevant to their question, users need tools that help them to explore and to understand their search results, rather than ones that eliminate a portion of those results. In this paper, we present DynaCat, a tool that dynamically categorizes search results into a hierarchical organization by using knowledge of important kinds of queries and a model of the domain terminology. Results from our evaluation show that DynaCat helps users find answers to those important types of questions more quickly and easily than when they use a relevance-ranking system or a clustering system.

[1]  R. Brian Haynes,et al.  Developing optimal search strategies for detecting clinically sound studies in MEDLINE. , 1994, Journal of the American Medical Informatics Association : JAMIA.

[2]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[3]  Russ B. Altman,et al.  Dynamic categorization: a method for decreasing information overload , 1999 .

[4]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[5]  Neal S. Coulter,et al.  Computing classification system 1998: Current status and future maintenance , 1998 .

[6]  William J. Doll,et al.  The Measurement of End-User Computing Satisfaction , 1988, MIS Q..

[7]  Stephen B. Johnson,et al.  Generic queries for meeting clinical information needs. , 1993, Bulletin of the Medical Library Association.

[8]  M S Tuttle,et al.  Toward reusable software components at the point of care. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[9]  Robert B. Allen,et al.  An interface for navigating clustered document sets returned by queries , 1993, COCS '93.

[10]  Donna K. Harman,et al.  Ranking Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[11]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[12]  Jennifer Rowley Organizing Knowledge: An Introduction to Information Retrieval , 1987 .

[13]  Michelle Q. Wang Baldonado,et al.  SONIA: a service for organizing networked information autonomously , 1998, DL '98.

[14]  Roy Rada,et al.  Status and Future Maintenance Report of the CCS Update Committee , 1998 .

[15]  Wanda Pratt Dynamic organization of search results using the UMLS , 1997, AMIA.