A Method for Measuring Wide Range Performance of Boolean Queries in Full-Text Databases

.................................................................................................................................................. 5 PREFACE ...................................................................................................................................................... 6 CONTENTS ................................................................................................................................................... 9 LIST OF SYMBOLS................................................................................................................................... 13

[1]  Donna K. Harman,et al.  Overview of the First Text REtrieval Conference (TREC-1) , 1992, TREC.

[2]  Nicholas J. Belkin,et al.  Retrieval techniques , 1987 .

[3]  Helen R. Tibbo,et al.  Freestyle vs. Boolean: A Comparison of Partial and Exact Match Retrieval Systems , 1998, Inf. Process. Manag..

[4]  Jaana Kristensen,et al.  Expanding End-Users' Query Statements for Free Text Searching with a Search-Aid Thesaurus , 1993, Inf. Process. Manag..

[5]  Stephen P. Harter,et al.  Search term combinations and retrieval overlap: A proposed methodology and case study , 1990, J. Am. Soc. Inf. Sci..

[6]  M. Fisher Worst-Case Analysis of Heuristic Algorithms , 1980 .

[7]  Brian C. O'Connor,et al.  Language and representation in information retrieval , 1993 .

[8]  Timo Niemi,et al.  A deductive data model for query expansion , 1996, SIGIR '96.

[9]  Mario Bunge,et al.  Scientific Research I , 1967 .

[10]  Carol Tenopir,et al.  Magazines in Full Text: Uses and Search Strategies. , 1989 .

[11]  Donna Harman,et al.  The fourth text REtrieval conference , 1996 .

[12]  Nicholas J. Belkin,et al.  Using Relevance Feedback and Ranking in Interactive Searching , 1995, TREC.

[13]  E. A. Fox,et al.  Combining the Evidence of Multiple Query Representations for Information Retrieval , 1995, Inf. Process. Manag..

[14]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[15]  Mirja Iivonen,et al.  Consistency in the Selection of Search Concepts and Search Terms , 1995, Information Processing & Management.

[16]  Paul B. Kantor,et al.  A study of information seeking and retrieving. II. Users, questions, and effectiveness , 1988, J. Am. Soc. Inf. Sci..

[17]  Jean Tague-Sutcliffe,et al.  An investigation of the optimization of search logic for the MEDLINE database , 1991, J. Am. Soc. Inf. Sci..

[18]  Jaana Kekäläinen,et al.  The IR Game - A Tool for Rapid Query Analysis in Cross-Language IR Experiments , 1998 .

[19]  Howard R. Turtle Natural language vs. Boolean query evaluation: a comparison of retrieval performance , 1994, SIGIR '94.

[20]  F. W. Lancaster,et al.  MEDLARS: Report on the Evaluation of Its Operating Efficiency. , 1997 .

[21]  Donald E. Polkinghorne,et al.  Methodology for the human sciences , 1983 .

[22]  F. W. Lancaster,et al.  Information Retrieval Today , 1993 .

[23]  E. Michael Keen,et al.  Presenting Results of Experimental Retrieval Comparisons , 1997, Inf. Process. Manag..

[24]  David Hawking,et al.  Overview of TREC-7 Very Large Collection Track , 1997, TREC.

[25]  Robert H. Ledwith On the Difficulties of Applying the Results of Information Retrieval Research to Aid in the Searching of Larg Scientific Databases , 1992, Inf. Process. Manag..

[26]  Raya Fidel,et al.  Searchers' selection of search keys: II. Controlled vocabulary or free‐text searching , 1991 .

[27]  Eero Sormunen An analysis of online searching knowledge for intermediary systems , 1989 .

[28]  E. Michael Keen Some aspects of proximity searching in text retrieval systems , 1992, J. Inf. Sci..

[29]  John Weiner,et al.  Letter to the Editor , 1992, SIGIR Forum.

[30]  Dagobert Soergel Indexing and retrieval performance: the logical evidence , 1994 .

[31]  Joyce A. Mitchell,et al.  The Medline/full-text research project , 1991, J. Am. Soc. Inf. Sci..

[32]  W. Rozeboom,et al.  Methodology: Foundations of Inference and Research in the Behavioral Sciences. , 1971 .

[33]  Philip J. Smith,et al.  Knowledge-Based Search Tactics , 1993, Inf. Process. Manag..

[34]  Raya Fidel,et al.  Factors affecting online bibliographic retrieval: A conceptual framework for research , 1983, J. Am. Soc. Inf. Sci..

[35]  Abraham Kaplan,et al.  The Conduct of Inquiry: Methodology for Behavioural Science , 1965 .

[36]  David A. Hull Stemming algorithms: a case study for detailed evaluation , 1996 .

[37]  P. Willett,et al.  An Introduction to Algorithmic and Cognitive Approaches for Information Retrieval , 1995 .

[38]  Bernice W. Polemis Nonparametric Statistics for the Behavioral Sciences , 1959 .

[39]  Gerard Salton,et al.  A Simple Blueprint for Automatic Boolean Query Processing , 1988, Inf. Process. Manag..

[40]  David C. Blair STAIRS redux: thoughts on the STAIRS evaluation, ten years after , 1996 .

[41]  Allen Newell,et al.  Heuristic programming: ill-structured problems , 1993 .

[42]  Gerard Salton,et al.  A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) , 1972, J. Am. Soc. Inf. Sci..

[43]  Jean Tague-Sutcliffe,et al.  The Pragmatics of Information Retrieval Experimentation Revisited , 1997, Inf. Process. Manag..

[44]  Cyril Cleverdon,et al.  The Cranfield tests on index language devices , 1997 .

[45]  Jens Rasmussen,et al.  Effectiveness testing of complex systems , 1997 .

[46]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[47]  Jacob Shapiro,et al.  Algorithm for automatic construction of query formulations in Boolean form , 1991, J. Am. Soc. Inf. Sci..

[48]  Robert M. Losee,et al.  Upper Bounds for Retrieval Performance and Their Use Measuring Performance and Generating Optimal Boolean Queries: Can It Get Any Better Than This? , 1994, Inf. Process. Manag..

[49]  M. Butler Information Retrieval Systems Characteristics, Testing, and Evaluation , 1970 .

[50]  Tefko Saracevic,et al.  Evaluation of evaluation in information retrieval , 1995, SIGIR '95.

[51]  James C. French,et al.  A Classification Approach to Boolean Query Reformulation , 1997, J. Am. Soc. Inf. Sci..

[52]  William R. Hersh,et al.  An Evaluation of Interactive Boolean and Natural Language Searching with an Online Medical Textbook , 1995, J. Am. Soc. Inf. Sci..

[53]  Raya Fidel Towards Expert Systems for the Selection of Search Keys , 1986 .

[54]  C. Cleverdon On the Inverse Relationship of Recall and Precision. , 1972 .

[55]  Stephen P. Harter,et al.  Heuristics for Online Information Retrieval: A Typology and Preliminary Listing. , 1985 .

[56]  Mirja Iivonen,et al.  Searchers and searchers: differences between the most and least consistent searches , 1995, SIGIR '95.

[57]  Raya Fidel,et al.  Moves in online searching , 1985 .

[58]  Carol Tenopir Online information retrieval: An introductory manual to principles and practice (4th edition) , 1994 .

[59]  K. Sparck Jones,et al.  INFORMATION RETRIEVAL TEST COLLECTIONS , 1976 .

[60]  Stephen P. Harter,et al.  Evaluation of information retrieval systems : Approaches, issues, and methods , 1997 .

[61]  Gerard Salton,et al.  Another look at automatic text-retrieval systems , 1986, CACM.

[62]  John Convey On-Line Information Retrieval Systems: An Introductory Manual to Principles and Practice , 1977 .

[63]  Jaana Kekäläinen,et al.  The impact of query structure and query expansion on retrieval performance , 1998, SIGIR '98.

[64]  Ellen M. Voorhees,et al.  The fifth text REtrieval conference (TREC-5) , 1997 .

[65]  Michael Kluck German Indexing and Retrieval Test Data Base (GIRT) - Some Results of the Pre-test , 1998, BCS-IRSG Annual Colloquium on IR Research.

[66]  William M. Shaw,et al.  Termrelevance Computations and Perfect Retrieval Performance , 1995, Inf. Process. Manag..

[67]  Donald T. Hawkins,et al.  Online Bibliographic Search Strategy Development. , 1982 .

[68]  F. W. Lancaster,et al.  Information retrieval: on-line , 1973 .

[69]  Donna Harman,et al.  The First Text REtrieval Conference (TREC-1) , 1993 .

[70]  Venkata Subramaniam,et al.  Information Retrieval: Data Structures & Algorithms , 1992 .

[71]  A. Kaplan The Conduct of Inquiry: Methodology for Behavioural Science , 1965 .

[72]  Robert M. Losee,et al.  Text Retrieval and Filtering , 1998, The Information Retrieval Series.

[73]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[74]  David C. Blair Full text retrieval: evaluation and implications , 1986 .

[75]  G.J.A. Riesthuis Subject analysis and indexing [Review of: R. Fugmann (1995) -] , 1995 .

[76]  Ari Pirkola,et al.  Studies on Linguistic Problems and Methods in Text Retrieval: The Effects of Anaphor and Ellipsis Resolution in Proximity Searching, and Translation and Query Structuring Methods in Cross-Language Retrieval , 1999 .

[77]  Martin Smith,et al.  The use of genetic programming to build Boolean queries for text retrieval through relevance feedback , 1997, J. Inf. Sci..

[78]  Carol Tenopir,et al.  Full text database retrieval performance , 1985 .

[79]  Alfred V. Aho,et al.  Foundations of Computer Science , 1979, Lecture Notes in Computer Science.

[80]  M. E. Maron,et al.  Full-text information retrieval: Further analysis and clarification , 1990, Inf. Process. Manag..

[81]  Raya Fidel,et al.  Online searching styles: A case-study-based model of searching behavior , 1984, J. Am. Soc. Inf. Sci..

[82]  Peter Ingwersen,et al.  Cognitive Perspectives of Information Retrieval Interaction: Elements of a Cognitive IR Theory , 1996, J. Documentation.

[83]  Vladimir G. Voiskunskii,et al.  Boolean Search: Current State and Perspectives , 1999, J. Am. Soc. Inf. Sci..

[84]  Carol Tenopir,et al.  Full text databases , 1990 .

[85]  Jaana Kekäläinen,et al.  The effects of query complexity, expansion and structure on retrieval performance in probabilistic text retrieval , 1999 .

[86]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[87]  Anita Sundaram,et al.  Information Retrieval: A Health Care Perspective , 1996 .

[88]  Stephen P. Harter,et al.  Online Information Retrieval: Concepts, Principles and Techniques , 1986 .

[89]  Paolo Toth,et al.  Knapsack Problems: Algorithms and Computer Implementations , 1990 .

[90]  John D. Holt,et al.  Boolean System Revisited: Its Performance and its Behavior , 1995, TREC.

[91]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[92]  Ari Pirkola,et al.  The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval , 1998, SIGIR '98.