Practical enhanced Boolean retrieval: Experiences with the smart and sire systems

Abstract During the last decade, studies with the SMART and SIRE systems have pioneered new techniques for improving the effectiveness of Boolean retrieval. Extended Boolean logic, automatic Boolean query construction, and Boolean feedback yield significant improvements according to a variety of experiments with SMART. Ranking of the output of Boolean queries has been shown to be of value with SIRE. Recent efforts have aimed at adapting SMART to allow large scale testing of these advanced retrieval methods. SIRE has been enhanced to include the p-norm scheme for extended Boolean query processing. Implementation and performance details about SMART and SIRE illustrate how commercially available retrieval systems can be improved to use the fruits of these research efforts.

[1]  Tadeusz Radecki,et al.  Incorporation of Relevance Feedback into Boolean Retrieval System , 1982, SIGIR.

[2]  Michael McGill,et al.  A performance evaluation of similarity measures, document term weighting schemes and representations in a Boolean environment , 1980, SIGIR '80.

[3]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[4]  Etienne Kerre,et al.  The use of fuzzy set theory in information retrieval and databases: A survey , 1986, J. Am. Soc. Inf. Sci..

[5]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[6]  Duncan A. Buell A problem in information retrieval with fuzzy sets , 1985, J. Am. Soc. Inf. Sci..

[7]  Edward A. Fox,et al.  Advanced feedback methods in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[8]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[9]  Tadeusz Radecki Mathematical model of time-effective information retrieval system based on the theory of fuzzy sets , 1977, Inf. Process. Manag..

[10]  Carol Tenopir,et al.  Full text databases , 1990 .

[11]  Aviezri S. Fraenkel,et al.  Local Feedback in Full-Text Retrieval Systems , 1977, JACM.

[12]  Edward A. Fox,et al.  Research Contributions , 2014 .

[13]  Daniel G. Shapiro,et al.  A Rule-Based Approach to Information Retrieval: Some Results and Comments , 1983, AAAI.

[14]  Yaacov Choueka,et al.  Computerized full-text retrieval systems and research in the humanities: The responsa project , 1980, Computers and the Humanities.

[15]  Daniel G. Shapiro,et al.  RUBRIC: A System for Rule-Based Information Retrieval , 1985, IEEE Transactions on Software Engineering.

[16]  Nicholas J. Belkin,et al.  Simulation of a distributed expert-based information provision mechanism , 1984 .

[17]  Edward A. Fox,et al.  Some Considerations for Implementing the SMART Information Retrieval System Under UNIX , 1983 .

[18]  Tadeusz Radecki Reducing the Perils of Merging Boolean and Weighted Retrieval Systems , 1982, J. Documentation.

[19]  Martin Dillon,et al.  The Use of Automatic Relevance feedback in Boolean Retrieval Systems , 1980, J. Documentation.

[20]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[21]  Michael McGill,et al.  An Evaluation of Factors Affecting Document Ranking by Information Retrieval Systems. , 1979 .

[22]  Tadeusz Radecki Generalized Boolean Methods of Information Retrieval , 1983, Int. J. Man Mach. Stud..

[23]  Harold Borko,et al.  Indexing concepts and methods , 1978 .

[24]  Edward A. Fox,et al.  Development of the coder system: A testbed for artificial intelligence methods in information retrieval , 1987, Inf. Process. Manag..

[25]  Edward A. Fox,et al.  Automatic query formulations in information retrieval , 1983, J. Am. Soc. Inf. Sci..

[26]  N. Rescher Many Valued Logic , 1969 .

[27]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[28]  W. Bruce Croft Boolean Queries and Term Dependencies in Probabilistic Retrieval Models. , 1986 .

[29]  Tadeusz Radecki A probabilistic approach to information retrieval in systems with boolean search request formulations , 1982, J. Am. Soc. Inf. Sci..

[30]  Joan M. Morrissey,et al.  An Intelligent Terminal for Implementing Relevance Feedback on Large Operational Retrieval Systems , 1982, SIGIR.

[31]  Abraham Bookstein,et al.  Fuzzy requests: An approach to weighted boolean searches , 1980, J. Am. Soc. Inf. Sci..

[32]  C. Paice Soft evaluation of Boolean search queries in information retrieval systems , 1984 .

[33]  Terry Noreault,et al.  Automatic ranked output from boolean searches in SIRE , 1977, J. Am. Soc. Inf. Sci..

[34]  M. Lynne Neufeld,et al.  Database history: from dinosaurs to compact discs , 1986 .

[35]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[36]  Valiollah Tahani,et al.  A fuzzy model of document retrieval systems , 1976, Inf. Process. Manag..

[37]  Edward A. Fox,et al.  Implementing SMART for minicomputers via relational processing With abstract data types , 1981, SIGSMALL '81.

[38]  A. Kandel Fuzzy Mathematical Techniques With Applications , 1986 .

[39]  Edward A. Fox,et al.  UNIX Micros for Students Majoring in Computer Science and Personal Information Retrieval. , 1986 .

[40]  Chris Buckley,et al.  Implementation of the SMART Information Retrieval System , 1985 .

[41]  Charles T. Meadow,et al.  Basics of online searching , 1981 .

[42]  Stephen Robertson,et al.  An algorithm for weighted searching on a Boolean system , 1984 .

[43]  Gerard Salton,et al.  Another look at automatic text-retrieval systems , 1986, CACM.

[44]  Edward A. Fox,et al.  Composite document extended retrieval: an overview , 1985, SIGIR '85.

[45]  Edward A. Fox,et al.  A comparison of two methods for boolean query relevancy feedback , 1984, Inf. Process. Manag..

[46]  Peter C. Cheeseman,et al.  In Defense of Probability , 1985, IJCAI.

[47]  Donald H. Kraft,et al.  A model for a weighted retrieval system , 1981, J. Am. Soc. Inf. Sci..

[48]  Edward A. Fox,et al.  An Automatic Environment for Boolean Information Retrival , 1983, IFIP Congress.

[49]  Tadeusz Radecki Mathematical model of information retrieval system based on the concept of Fuzzy thesaurus , 1976, Inf. Process. Manag..

[50]  Tamas E. Doszkocs From Research to Application: The Cite Natural Language Information System , 1982, SIGIR.

[51]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[52]  E. Fox,et al.  A Comparison of Two Methods For Soft Boolean Operator Interpretation In Information Retrieval , 1986 .

[53]  Tadeusz Radecki,et al.  Probabilistic methods for ranking output documents in conventional Boolean retrieval systems , 1988, Inf. Process. Manag..

[54]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[55]  Chris Buckley,et al.  Optimization of inverted vector searches , 1985, SIGIR '85.

[56]  Michael McGill,et al.  Syracuse information retrieval experiment (SIRE): design of an on-line bibliographic retrieval system , 1976, SIGF.

[57]  Jeffrey Katzer,et al.  A study of the overlap among document representations , 1983, SIGIR '83.

[58]  Edward Fox,et al.  Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types , 1983 .

[59]  Edward A. Fox,et al.  Architecture of an expert system for composite document analysis, representation, and retrieval , 1997, Int. J. Approx. Reason..

[60]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..