Dipe-D: A Tool for Knowledge-Based Query Formulation in Information Retrieval

The paper reports the development of Dipe-D, a knowledge-based procedure for the formulation of Boolean queries in information retrieval. Dipe-D creates a query in two steps: (1) the user's information need is developed interactively, while identifying the concepts of the information need, and subsequently (2) the collection of concepts identified is automatically transformed into a Boolean query. In the first step, the subject area—as represented in a knowledge base—is explored by the user. He does this by means of specifying the (concepts that meet his) information need in an artificial language and looking through the solution as provided by the computer. The specification language allows one to specify concepts by their features, both in precise terms as well as vaguely. By repeating the process of specifying the information need and exploring the resulting concepts, the user may precisely single out the concepts that describe his information need. In the second step, the program provides the designations (and variants) for the concepts identified, and connects them by appropriate operators. Dipe-D is meant to improve on existing procedures that identify the concepts less systematically, create a query manually, and then sometimes expand that query. Experiments are reported on each of the two steps; they indicate that the first step identifies only but not all the relevant concepts, and the second step performs (at least) as good as human beings do.

[1]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[2]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[3]  W. Bruce Croft,et al.  The INQUERY Retrieval System , 1992, DEXA.

[4]  Stephen E. Robertson,et al.  Interactive Thesaurus Navigation: Intelligence Rules OK? , 1995, J. Am. Soc. Inf. Sci..

[5]  Jaana Kekäläinen,et al.  The effects of query complexity, expansion and structure on retrieval performance in probabilistic text retrieval , 1999 .

[6]  Richard H. Fowler,et al.  Document Explorer Visualizations of WWW Document and Term Spaces , 1996 .

[7]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[8]  Roy Rada,et al.  A Graphical Thesaurus-Based Information Retrieval System , 1989, Int. J. Man Mach. Stud..

[9]  Edward A. Fox,et al.  Advanced feedback methods in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[10]  Timo Niemi,et al.  A deductive data model for query expansion , 1996, SIGIR '96.

[11]  R. V. D. Pol Knowledge-based query formulation in information retrieval , 2000 .

[12]  F. J. Wiesman,et al.  Information retrieval by graphically browsing meta-information , 1998 .

[13]  Ruud W. van der Pol,et al.  Dipe-R: a knowledge representation language , 2003, Data Knowl. Eng..

[14]  Claude Chrisment,et al.  Querying a Hypertext Information Retrieval System by the Use of Classification , 1993, Inf. Process. Manag..

[15]  Daniel G. Shapiro,et al.  RUBRIC: A System for Rule-Based Information Retrieval , 1985, IEEE Transactions on Software Engineering.

[16]  Amanda Spink,et al.  Interaction in information retrieval: selection and effectiveness of search terms , 1997 .

[17]  Amanda Spink,et al.  Term relevance feedback and query expansion: relation to design , 1994, SIGIR '94.

[18]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[19]  Chris Buckley,et al.  Improving automatic query expansion , 1998, SIGIR '98.

[20]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[21]  Matthias Hemmje,et al.  A 3D Based User Interface for Information Retrieval Systems , 1993, Workshop on Database Issues for Data Visualization.

[22]  Alan F. Smeaton,et al.  The Retrieval Effects of Query Expansion on a Feedback Document Retrieval System , 1983, Comput. J..

[23]  W. Bruce Croft,et al.  I 3 R: a new approach to the design of document retrieval systems , 1987 .

[24]  David A. Hull Using Structured Queries for Disambiguation in Cross-Language Information Retrieval , 1997 .

[25]  Jan L. G. Dietz,et al.  The ARCHIMEDES Network System: a system for searching and accessing information in multiple multimedia sources , 2004, Journal of Intelligent Information Systems.

[26]  Toshio Yokoi,et al.  The EDR electronic dictionary , 1995, CACM.

[27]  Florence Sèdes,et al.  Classification and Information retrieval in hypertext systems , 1991, RIAO.

[28]  Clement T. Yu,et al.  An Evaluation of Term Dependence Models in Information Retrieval , 1982, SIGIR.

[29]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[30]  Jaana Kristensen,et al.  Expanding End-Users' Query Statements for Free Text Searching with a Search-Aid Thesaurus , 1993, Inf. Process. Manag..

[31]  Luuk Matthijssen,et al.  Interfacing Between Lawyers and Computers: An Architecture for Knowledge-based Interfaces to Legal Databases , 1999 .

[32]  Peeter Kool,et al.  ER-based Information Retrieval in a Mixed Database Environment , 1993, ER.

[33]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[34]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[35]  Jaana Kekäläinen,et al.  The impact of query structure and query expansion on retrieval performance , 1998, SIGIR '98.

[36]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[37]  JonesSusan,et al.  Interactive thesaurus navigation , 1995 .

[38]  Jakub Zavrel Neural navigation interfaces for Information Retrieval: Are they more than an appealing idea? , 2004, Artificial Intelligence Review.