Automated Assistance in the Formulation of Search Statements for Bibliographic Databases

Abstract We report on the design and construction of features of an automated query system which will assist pharmacologists who are not information specialists to access the Derwent Drug File (DDF) pharmacological database. Our approach was to first elucidate those search skills of the search intermediary which might prove tractable to automation. Modules were then produced which assist in the three important subtasks of search statement generation, namely vocabulary selection, the choice of context indicators and query reformulation. Vocabulary selection is facilitated by approximate string matching, morphological analysis, browsing and menu searching. The context of the study, such as treatment or metabolism, is determined using a system of advisory menus. The task of query reformulation is performed using user feedback on retrieved documents, thesaurus relations between document index terms and term postings data. Use is made of diverse information sources, including electronic forms of printed search aids, a thesaurus and a medical dictionary. The system will be of use both to semicasual users and experienced intermediaries. Many of the ideas developed should prove transportable to domains other than pharmacology: the techniques for thesaurus manipulation are designed for use with any hierarchical thesaurus.

[1]  Helen M. Brooks,et al.  Expert systems and intelligent information retrieval , 1987, Inf. Process. Manag..

[2]  Yves Chiaramella,et al.  A prototype of an intelligent system for information retrieval: IOTA , 1987, Inf. Process. Manag..

[3]  Peretz Shoval,et al.  Principles, procedures and rules in an expert system for information retrieval , 1985, Inf. Process. Manag..

[4]  Dietmar Wolfram,et al.  Searcher response in a hypertext-based bibliographic information retrieval system , 1995 .

[5]  Chris D. Paice Method for Evaluation of Stemming Algorithms Based on Error Counting , 1996, J. Am. Soc. Inf. Sci..

[6]  R. P. Rodgers Automated Retrieval from Multiple Disparate Information Sources: The World Wide Web and the NLM's Sourcerer Project , 1995, J. Am. Soc. Inf. Sci..

[7]  George McMurdo How the Internet was indexed , 1995, J. Inf. Sci..

[8]  G. Philip Use of 'leading-edge' information systems by academic chemists in the UK: part II. Constraints and the need for usability engineering , 1996, J. Inf. Sci..

[9]  Richard V. Janke,et al.  Online in Canada. , 1981 .

[10]  Geoffrey P. Ellis,et al.  HIBROWSE for bibliographic databases , 1994, J. Inf. Sci..

[11]  Paul R. Cohen,et al.  Information retrieval by constrained spreading activation in semantic networks , 1987, Inf. Process. Manag..

[12]  Colin G. Drury,et al.  EARS: An online bibliographic search and retrieval system based on ordered explosion , 1987, Inf. Process. Manag..

[13]  Donald T. Hawkins,et al.  Online Bibliographic Search Strategy Development. , 1982 .

[14]  Colin H. Davidson,et al.  Improved Design of Graphic displays in Thesauri - through Technology and Ergonomics , 1986, J. Documentation.

[15]  Jean-Pierre Chevallet,et al.  About Retrieval Models and Logic , 1992, Comput. J..

[16]  Tschera Harkness Connell Subject searching in online catalogs: metaknowledge used by experienced searchers , 1995 .

[17]  C. J. van Rijsbergen,et al.  Interactive querying techniques for an office filing facility , 1986, Inf. Process. Manag..

[18]  H L Bleich,et al.  PaperChase: a computer program to search the medical literature. , 1981, The New England journal of medicine.

[19]  Richard Fikes,et al.  The role of frame-based representation in reasoning , 1985, CACM.

[20]  Ray R. Larson Evaluation of advanced retrieval techniques in an experimental online catalog , 1992 .

[21]  Helen M. Brooks,et al.  Plexus-the expert system for referral , 1987, Inf. Process. Manag..

[22]  Ross Wilkinson Using Combination of Evidence for Term Expansion , 1997, BCS-IRSG Annual Colloquium on IR Research.

[23]  Nicholas V. Findler,et al.  SHRIF, A General-Purpose System for Heuristic Retreival of Information and Facts, Applied to Medical Knowledge Processing , 1992, Inf. Process. Manag..

[24]  John Davies,et al.  User Profiling Techniques: A Critical Review , 1997, BCS-IRSG Annual Colloquium on IR Research.

[25]  A. Steven Pollitt,et al.  CANSEARCH: An expert systems approach to document retrieval , 1987, Inf. Process. Manag..

[26]  Louise T. Su The Relevance of Recall and Precision in User Evaluation , 1994, J. Am. Soc. Inf. Sci..

[27]  Anne B. Piternick Searching vocabularies: a developing category of online search tools , 1984 .

[28]  Thomas J. Froehlich,et al.  Relevance reconsidered—towards an agenda for the 21st century: introduction to special topic issue on relevance research , 1994 .

[29]  Hsinchun Chen,et al.  Knowledge-based document retrieval: framework and design , 1992, J. Inf. Sci..

[30]  Arnold Rochfeld,et al.  Relationship of relationships and other inter-relationship links in E-R model , 1992, Data Knowl. Eng..

[31]  Michael A. Shepherd,et al.  PSI: a portable self‐contained intermediary for access to bibliographic database systems , 1984 .

[32]  Susan Jones A thesaurus data model for an intelligent retrieval system , 1993, J. Inf. Sci..

[33]  Graham A Stephen,et al.  Approximate String Matching , 1994, Encyclopedia of Algorithms.

[34]  Robert N. Oddy,et al.  INFORMATION RETRIEVAL THROUGH MAN‐MACHINE DIALOGUE , 1977 .

[35]  W. Bruce Croft Approaches to Intelligent Information Retrieval , 1987, Inf. Process. Manag..

[36]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[37]  Jiabin Wang,et al.  A Study of User Performance and Attitudes with Information Retrieval Interfaces , 1995, J. Am. Soc. Inf. Sci..

[38]  Dietmar Wolfram,et al.  Searcher Response in a Hypertext-Based Bibliographic Information Retrieval System , 1995, J. Am. Soc. Inf. Sci..

[39]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[40]  Chris D. Paice,et al.  Method for Evaluation of Stemming Algorithms Based on Error Counting , 1996, J. Am. Soc. Inf. Sci..

[41]  Martin Gogolla,et al.  Conceptual modelling of database applications using extended ER model , 1992, Data Knowl. Eng..

[42]  Daniel G. Shapiro,et al.  RUBRIC: A System for Rule-Based Information Retrieval , 1985, IEEE Transactions on Software Engineering.

[43]  Jaime G. Carbonell,et al.  CoalSORT: A Knowledge-Based Interface , 1987, IEEE Expert.

[44]  R. P. Channing Rodgers Automated retrieval from multiple disparate information sources: The World Wide Web and the NLM's sourcerer project , 1995 .

[45]  Jonathan Furner Digital images in libraries: an overview , 1997 .

[46]  Robert F. Simmons,et al.  A text knowledge base from the AI handbook , 1983, Inf. Process. Manag..

[47]  Brenda M. Rimmer Derwent Publications Ltd. , 1988 .

[48]  Susan Gauch,et al.  Search improvement via automatic query reformulation , 1991, TOIS.

[49]  Philip J. Smith,et al.  Knowledge-Based Search Tactics , 1993, Inf. Process. Manag..

[50]  Stephen E. Robertson,et al.  Interactive Thesaurus Navigation: Intelligence Rules OK? , 1995, J. Am. Soc. Inf. Sci..

[51]  Chris D. Paice,et al.  Another stemmer , 1990, SIGF.

[52]  Lisa F. Rau,et al.  Knowledge organization and access in a conceptual information system , 1987, Inf. Process. Manag..

[53]  Philippe Aigrain,et al.  A model for the evaluation of expansion techniques in information retrieval systems , 1994 .

[54]  Micheline Hancock-Beaulieu,et al.  An Evaluation of Interactive Query Expansion in an Online Library Catalogue with a Graphical User Interface , 1995, J. Documentation.

[55]  Reginald Ferber,et al.  An Associative Model of Word Selection in the Generation of Search Queries , 1995, J. Am. Soc. Inf. Sci..

[56]  Hiroyuki Shinnou Redefining similarity in a thesaurus by using corpora , 1996, COLING.

[57]  George W. Adamson,et al.  The use of an association measure based on character structure to identify semantically related pairs of words and document titles , 1974, Inf. Storage Retr..

[58]  Myoung-Ho Kim,et al.  Ranking Documents in Thesaurus-Based Boolean Retrieval Systems , 1994, Inf. Process. Manag..

[59]  Tamas E. Doszkocs,et al.  CITE NLM: natural-language searching in an online catalog , 1983 .

[60]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[61]  James J. Cimino,et al.  Vocabulary and Health Care Information Technology: State of the Art , 1995 .

[62]  E. H. Hutten SEMANTICS , 1953, The British Journal for the Philosophy of Science.

[63]  Carlo Vernimb,et al.  Automatic query adjustment in document retrieval , 1977, Inf. Process. Manag..

[64]  Chris D. Paice,et al.  Expert systems for information retrieval , 1986 .

[65]  A. S. Pollitt Intelligent interfaces to text retrieval systems , 1990 .

[66]  Mark H. Chignell,et al.  Knowledge-based search tactics for an intelligent intermediary system , 1989, TOIS.

[67]  Roy Rada,et al.  A Graphical Thesaurus-Based Information Retrieval System , 1989, Int. J. Man Mach. Stud..

[68]  Jonathan Furner IR on the Web : an overview , 1996 .