Advances in Information Retrieval: Where Is That /#*&@¢ Record?

Publisher Summary This chapter discusses information retrieval system. Much effort goes into determining where to store information and how to retrieve it efficiently and effectively. The information retrieval system is a special class of information systems along with database management and question-answering systems. The key element that distinguishes information retrieval systems is the concept that the relevance of any given record to any specific query cannot be determined exactly. In recent years, many commercialized systems and theoretical approaches have emerged. Nevertheless, information retrieval is an interesting and challenging area of study and application, and it is an area currently in great flux. The chapter begins with discussion of various types of information systems that relate to information retrieval. This will be followed by a general discussion of some systems currently available in the marketplace. Then, the work being done in the areas of content analysis, including natural language applications, query processing, and systems evaluation, will be presented. Finally, current research efforts in related areas, such as artificial intelligence, and fuzzy subset theory, will be presented.

[1]  Abraham Bookstein,et al.  Fuzzy requests: An approach to weighted boolean searches , 1980, J. Am. Soc. Inf. Sci..

[2]  Tadeusz Radecki,et al.  Fuzzy set theoretical approach to document retrieval , 1979, Inf. Process. Manag..

[3]  William S. Cooper On Deriving Design Equations for Information Retrieval Systems. , 1970 .

[4]  Duncan A. Buell,et al.  An analysis of some fuzzy subset applications to information retrieval systems , 1982 .

[5]  Linda C. Smith Implications of artificial intelligence for end user use of online systems , 1980 .

[6]  Stephen P. Harter,et al.  A probabilistic approach to automatic keyword indexing. Part I. On the Distribution of Specialty Words in a Technical Literature , 1975, J. Am. Soc. Inf. Sci..

[7]  Clement T. Yu,et al.  Automatic indexing using term discrimination and term precision measurements , 1976, Information Processing & Management.

[8]  Gian Piero Zarri RESEDA, an Information Retrieval system using artificial intelligence and knowledge representation techniques , 1983, SIGIR 1983.

[9]  William B. Rouse,et al.  Models of human behavior in information seeking tasks , 1982, Inf. Process. Manag..

[10]  Abraham Bookstein,et al.  Information retrieval: A sequential learning process , 1983, J. Am. Soc. Inf. Sci..

[11]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[12]  Don R. Swanson,et al.  Probabilistic models for automatic indexing , 1974, J. Am. Soc. Inf. Sci..

[13]  A. S. Pollitt End user touch searching for cancer therapy literature: a rule based approach , 1983, SIGIR 1983.

[14]  John A. Swets,et al.  Effectiveness of information retrieval methods , 1969 .

[15]  M. E. Maron,et al.  A computer system for inference execution and data retrieval , 1967, CACM.

[16]  Luis de Sopeña Natural language grammars for an information system , 1983, SIGIR 1983.

[17]  Constantin Virgil Negoita,et al.  On fuzziness in information retrieval , 1976 .

[18]  Tomek Strzalkowski,et al.  Natural language interface to the question-answering system for physicians , 1984 .

[19]  Sosuke Iwai,et al.  Topological Fuzzy Sets as a Quantitative Description of Analogical Inference and Its Application to Question-Answering Systems for Information Retrieval , 1982, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  I. L. Travis,et al.  Design equations for citation retrieval systems: Their role in research and analysis , 1977, Inf. Process. Manag..

[21]  Karen Kukich Knowledge-Based Report Generation: a technique for automatically generating natural language reports from databases , 1983, SIGIR 1983.

[22]  Donald H. Kraft,et al.  A Bayesian approach to user stopping rules for information retrieval systems , 1981, Inf. Process. Manag..

[23]  Karen Spärck Jones Experiments in relevance weighting of search terms , 1979, Inf. Process. Manag..

[24]  Charles T. Meadow,et al.  A Computer Intermediary for Interactive Database Searching. I. Design , 2007, J. Am. Soc. Inf. Sci..

[25]  Gerard Salton,et al.  Mathematics and Information Retrieval , 1979, J. Documentation.

[26]  Valiollah Tahani,et al.  A fuzzy model of document retrieval systems , 1976, Inf. Process. Manag..

[27]  Gerald DeJong Artificial intelligence implications for information retrieval , 1983, SIGIR 1983.

[28]  Ian A. Macleod Towards an information retrieval language based on a relational view of data , 1977, Inf. Process. Manag..

[29]  Duncan A. Buell,et al.  LIARS: A Software Environment for Testing Query Processing Strategies , 1982, SIGIR.

[30]  Maurice I. Crystal,et al.  FRED, a Front End for Databases. , 1982 .

[31]  David E. Toliver,et al.  OL'SAM: An Intelligent Front-End for Bibliographic Information Retrieval. , 1982 .

[32]  Blaise Cronin,et al.  Expert systems and library/information work , 1983 .

[33]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[34]  Tadeusz Radecki Similarity measures for boolean search request formulations , 1982, J. Am. Soc. Inf. Sci..

[35]  Kazimierz Choros,et al.  Weighted descriptors and relative indexing in a document retrieval system model , 1982, Inf. Process. Manag..

[36]  Gerard Salton Some research problems in automatic information retrieval , 1983, SIGIR 1983.

[37]  Kevin P. Jones How do we Index?: a Report of some Aslib Informatics Group Activity , 1983, J. Documentation.

[38]  W. S. Cooper Expected search length: A single measure of retrieval effectiveness based on the weak ordering action of retrieval systems , 1968 .

[39]  Michael McGill,et al.  Syracuse information retrieval experiment (SIRE): design of an on-line bibliographic retrieval system , 1976, SIGF.

[40]  Donald H. Kraft,et al.  Evaluation of information retrieval systems: A decision theory approach , 1978, J. Am. Soc. Inf. Sci..

[41]  J A Swets,et al.  Information Retrieval Systems. , 1963, Science.

[42]  Tadeusz Radecki Mathematical model of information retrieval system based on the concept of Fuzzy thesaurus , 1976, Inf. Process. Manag..

[43]  Stephen P. Harter,et al.  A probabilistic approach to automatic keyword indexing , 1974 .

[44]  William S. Cooper,et al.  Exploiting the maximum entropy principle to increase retrieval effectiveness , 1983, J. Am. Soc. Inf. Sci..

[45]  Donald H. Kraft,et al.  Threshold values and Boolean retrieval systems , 1981, Inf. Process. Manag..

[46]  Duncan A. Buell,et al.  A general model of query processing in information retrieval systems , 1981, Inf. Process. Manag..

[47]  M. E. Maron Associative Search Techniques versus Probabilistic Retrieval Models , 1982, J. Am. Soc. Inf. Sci..

[48]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[49]  Karen Markey,et al.  Catalog Use Studies--Since the Introduction of Online Interactive Catalogs: Impact on Design for Subject Access , 1983 .

[50]  Martin Dillon,et al.  A prevalence formula for automatic relevance feedback in Boolean systems , 1983, Inf. Process. Manag..

[51]  H M Schoolman,et al.  Automated information retrieval in science and technology. , 1980, Science.

[52]  R. K. Waldstein,et al.  Term relevance weights in on-line information retrieval , 1977, Inf. Process. Manag..

[53]  Donald H. Kraft,et al.  A model for a weighted retrieval system , 1981, J. Am. Soc. Inf. Sci..

[54]  Clement T. Yu,et al.  Precision Weighting—An Effective Automatic Indexing Method , 1976, J. ACM.

[55]  S. Bing Yao,et al.  Multi-dimensional clustering for data base organizations , 1977, Inf. Syst..

[56]  Tamas E. Doszkocs From Research to Application: The Cite Natural Language Information System , 1982, SIGIR.

[57]  Wladimir M. Sachs,et al.  An approach to associative retrieval through the theory of fuzzy sets , 1976, J. Am. Soc. Inf. Sci..

[58]  Martin Dillon,et al.  Fully Automatic Book Indexing , 1983, J. Documentation.

[59]  Clement T. Yu,et al.  An Evaluation of Term Dependence Models in Information Retrieval , 1982, SIGIR.

[60]  Tamas E. Doszkocs,et al.  AID, an Associative Interactive Dictionary for online searching , 1978 .

[61]  Clement T. Yu,et al.  Term Weighting in Information Retrieval Using the Term Precision Model , 1982, JACM.

[62]  William Cooper,et al.  A General Mathematical Model for Information Retrieval Systems , 1976, The Library Quarterly.

[63]  Abraham Bookstein,et al.  Outline of a General Probabilistic Retrieval Model , 1983, J. Documentation.

[64]  Edward A. Fox,et al.  Automatic query formulations in information retrieval , 1983, J. Am. Soc. Inf. Sci..

[65]  E R Siegel,et al.  The hepatitis knowledge base. A prototype information transfer system. , 1980, Annals of internal medicine.

[66]  Witold Litwin,et al.  Messidor: A Distributed Information Retrieval Systems , 1982, SIGIR.

[67]  L. Zadeh,et al.  Fuzzy sets versus probability , 1980, Proceedings of the IEEE.

[68]  M. E. Maron,et al.  On indexing, retrieval and the meaning of about , 1977, J. Am. Soc. Inf. Sci..

[69]  P. Zunde,et al.  Indexing Consistency and Quality. , 1969 .

[70]  Donald H. Kraft A threshold rule applied to the retrieval decision model , 1978, J. Am. Soc. Inf. Sci..

[71]  Paul Bratley,et al.  Processing truncated terms in document retrieval systems , 1982, Inf. Process. Manag..

[72]  Gerard Salton,et al.  Automatic text analysis. , 1970 .

[73]  Constantin Virgil Negoita ON THE NOTION OF RELEVANCE IN INFORMATION RETRIEVAL , 1973 .

[74]  William S. Cooper,et al.  Foundations of Probabilistic and Utility-Theoretic Indexing , 1978, JACM.

[75]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[76]  Clinton R. Foulk,et al.  Information flow and analysis: Theory, simulation, and experiments. I. Basic theoretical and conceptual development , 1981, J. Am. Soc. Inf. Sci..

[77]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[78]  Gerard Salton,et al.  Generation and search of clustered files , 1978, TODS.

[79]  H L Bleich,et al.  PaperChase: a computer program to search the medical literature. , 1981, The New England journal of medicine.

[80]  Linda C. Smith Artificial intelligence in information retrieval systems , 1976, Inf. Process. Manag..

[81]  Martin Dillon,et al.  FASIT: A fully automatic syntactically based indexing system , 1983, J. Am. Soc. Inf. Sci..

[82]  Tadeusz Radecki Generalized Boolean Methods of Information Retrieval , 1983, Int. J. Man Mach. Stud..

[83]  Nicholas V. Findler A HEURISTIC INFORMATION RETRIEVAL SYSTEM BASED ON ASSOCIATIVE NETWORKS , 1979 .

[84]  Clement T. Yu,et al.  The measurement of term importance in automatic indexing , 1981, J. Am. Soc. Inf. Sci..

[85]  Zygmunt Mazur Inverted file organization in the information retrieval system based on thesaurus with weights , 1979, Inf. Process. Manag..

[86]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[87]  Donald H. Kraft,et al.  Stopping rules and their effect on expected search length , 1979, Inf. Process. Manag..

[88]  Donald H. Kraft,et al.  Performance measurement in a fuzzy retrieval environment , 1981, SIGIR 1981.

[89]  Don R. Swanson,et al.  A decision theoretic foundation for indexing , 1975, J. Am. Soc. Inf. Sci..

[90]  Clement T. Yu,et al.  Effective information retrieval using term accuracy , 1977, CACM.

[91]  W. Bruce Croft Document representation in probabilistic models of information retrieval , 1981, J. Am. Soc. Inf. Sci..

[92]  Tadeusz Radecki Reducing the Perils of Merging Boolean and Weighted Retrieval Systems , 1982, J. Documentation.

[93]  C. J. van Rijsbergen,et al.  An Evaluation of feedback in Document Retrieval using Co‐Occurrence Data , 1978, J. Documentation.

[94]  Martin Dillon,et al.  The Use of Automatic Relevance feedback in Boolean Retrieval Systems , 1980, J. Documentation.

[95]  Sadaaki Miyamoto,et al.  Generation of a pseudothesaurus for information retrieval based on cooccurrences and fuzzy set operations , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[96]  Richard S. Marcus,et al.  An experimental comparison of the effectiveness of computers and humans as search intermediaries , 1983, J. Am. Soc. Inf. Sci..

[97]  Donald H. Kraft,et al.  Problems in modeling a weighted Boolean retrieval system , 1979 .

[98]  Donald H. Kraft,et al.  A decision theory view of the information retrieval situation: An operations research approach , 1973, J. Am. Soc. Inf. Sci..

[99]  Stephen E. Robertson,et al.  On the nature of fuzz: A diatribe , 1978, J. Am. Soc. Inf. Sci..

[100]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[101]  Tadeusz Radecki Mathematical model of time-effective information retrieval system based on the theory of fuzzy sets , 1977, Inf. Process. Manag..

[102]  Pauline V. Angione,et al.  On the equivalence of boolean and weighted searching based on the convertibility of query forms , 1975, J. Am. Soc. Inf. Sci..

[103]  Richard S. Marcus,et al.  A translating computer interface for end-user operation of heterogeneous retrieval systems. II. Evaluations , 1981, J. Am. Soc. Inf. Sci..

[104]  Van Rijsbergen,et al.  A theoretical basis for the use of co-occurence data in information retrieval , 1977 .

[105]  Abraham Bookstein On the perils of merging boolean and weighted retrieval systems , 1978, J. Am. Soc. Inf. Sci..

[106]  James E. Rush Library Automation Systems and Networks , 1982, Adv. Comput..

[107]  Don R. Swanson,et al.  Information Retrieval as a Trial-And-Error Process , 1977, The Library Quarterly.

[108]  Madeleine Bates,et al.  Information retrieval using a transportable natural language interface , 1983, SIGIR 1983.

[109]  Donald H. Kraft,et al.  Fuzzy Sets and Generalized Boolean Retrieval Systems , 1983, Int. J. Man Mach. Stud..

[110]  Lee A. Hollaar Hardware systems for text information retrieval , 1983, SIGIR 1983.

[111]  Tadeusz Radecki On a Probabilistic Approach to determining the Similarity between Boolean Search Request formulations , 1982, J. Documentation.

[112]  Horst Biller,et al.  On the Architecture of a System Integrating Data Base Management and Information Retrieval , 1982, SIGIR.

[113]  Camilla Schwind Semantic trees for natural language representation , 1983, Inf. Process. Manag..

[114]  Herbert B. Landau Guide to Reference Sources in the Computer Sciences, Ciel Carter. New York, Macmillan Information, 237 p. (1974) , 1976, J. Am. Soc. Inf. Sci..

[115]  Terry Noreault,et al.  Automatic ranked output from boolean searches in SIRE , 1977, J. Am. Soc. Inf. Sci..

[116]  Constantin Virgil Negoita On the application of the fuzzy sets separation theorem for automatic classification in information retrieval systems , 1973, Inf. Sci..

[117]  Donald H. Kraft,et al.  Operations Research Applied to Document Indexing and Retrieval Decisions , 1977, JACM.

[118]  Abraham Bookstein,et al.  A comparison of two systems of weighted boolean retrieval , 1981, J. Am. Soc. Inf. Sci..