A Simple Blueprint for Automatic Boolean Query Processing

Abstract The operations of conventional information retrieval systems are based on the use of Boolean query formulations designed to reflect user needs. Unfortunately, many retrieval system users are not able to construct useful Boolean expressions, and trained search intermediaries must normally help in the search formulation process. In this article, a new Boolean retrieval environment is outlined in which the queries are automatically constructed from original natural language formulations provided by the users. A soft Boolean logic is introduced that relaxes the interpretation of the Boolean operators and provides much enhanced retrieval effectiveness. All the proposed processing modifications are compatible with conventional inverted file technologies, and with normal operational retrieval methodologies.

[1]  Karen Spärck Jones Experiments in relevance weighting of search terms , 1979, Inf. Process. Manag..

[2]  Donald H. Kraft,et al.  TIRS: a topological information retrieval system satisfying the requirements of the Waller-Kraft wish list , 1987, SIGIR '87.

[3]  Clement T. Yu,et al.  The measurement of term importance in automatic indexing , 1981, J. Am. Soc. Inf. Sci..

[4]  P. C. Wong,et al.  Generalized vector spaces model in information retrieval , 1985, SIGIR '85.

[5]  Valiollah Tahani,et al.  A fuzzy model of document retrieval systems , 1976, Inf. Process. Manag..

[6]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[7]  Abraham Bookstein,et al.  Fuzzy requests: An approach to weighted boolean searches , 1980, J. Am. Soc. Inf. Sci..

[8]  Tadeusz Radecki,et al.  Fuzzy set theoretical approach to document retrieval , 1979, Inf. Process. Manag..

[9]  Abraham Bookstein On the perils of merging boolean and weighted retrieval systems , 1978, J. Am. Soc. Inf. Sci..

[10]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[11]  Harry Wu On query formulation in information retrieval , 1981 .

[12]  Gerard Salton,et al.  A blueprint for automatic indexing , 1981, SIGF.

[13]  Tadeusz Radecki,et al.  Incorporation of Relevance Feedback into Boolean Retrieval System , 1982, SIGIR.

[14]  Vijay V. Raghavan,et al.  Extended Boolean query processing in the generalized vector space model , 1989, Inf. Syst..

[15]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[16]  Tadeusz Radecki A probabilistic approach to information retrieval in systems with boolean search request formulations , 1982, J. Am. Soc. Inf. Sci..

[17]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[18]  Edward A. Fox,et al.  Automatic query formulations in information retrieval , 1983, J. Am. Soc. Inf. Sci..

[19]  Edward Fox,et al.  Extending the boolean and vector space models of information retrieval with p-norm queries and multiple concept types , 1983 .

[20]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[21]  Edward A. Fox,et al.  Research Contributions , 2014 .

[22]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[23]  Edward A. Fox,et al.  Advanced feedback methods in information retrieval , 1985, J. Am. Soc. Inf. Sci..

[24]  Terry Noreault,et al.  Automatic ranked output from boolean searches in SIRE , 1977, J. Am. Soc. Inf. Sci..

[25]  C. Paice Soft evaluation of Boolean search queries in information retrieval systems , 1984 .