Experiments on incorporating syntactic processing of user queries into a document retrieval strategy

Traditional information has relied on the extensive use of statistical parameters in the implementation of retrieval strategies. This paper sets out to investigate whether linguistic processes can be used as part of a document retrieval strategy. This is done by predefining a level of syntactic analysis of user queries only, to be used as part of the retrieval process. A large series of experiments on an experimental test collection are reported which use a parser for noun phrases as part of the retrieval strategy. The results obtained from the experiments do yield improvements in the level of retrieval effectiveness and given the crude linguistic process used and the way it was used on queries and not on document texts, suggests that the approach of using linguistic processing in retrieval, is valid.

[1]  Bruno Defude Different levels of expertise for an expert system in information retrieval , 1985, SIGIR '85.

[2]  Claudia Marcus Prolog Programming , 1986 .

[3]  Naomi Sager,et al.  Natural language information processing , 1980 .

[4]  David H. D. Warren,et al.  Definite Clause Grammars for Language Analysis - A Survey of the Formalism and a Comparison with Augmented Transition Networks , 1980, Artif. Intell..

[5]  Martin Dillon,et al.  FASIT: A fully automatic syntactically based indexing system , 1983, J. Am. Soc. Inf. Sci..

[6]  Martin Dillon,et al.  Fully Automatic Book Indexing , 1983, J. Documentation.

[7]  Gregor Thurmair A common architecture for different text processing techniques in an information retrieval environment , 1986, SIGIR '86.

[8]  Martin Chodorow,et al.  The EPISTLE Text-Critiquing System , 1982, IBM Syst. J..

[9]  Joel L. Fagan The effectiveness of a nonsyntatic approach to automatic phrase indexing for document retrieval , 1989 .

[10]  Dario De Jaco,et al.  An information retrieval system based on artificial intelligence techniques , 1986, SIGIR '86.

[11]  Ralph Grishman Natural language processing , 1984, J. Am. Soc. Inf. Sci..

[12]  David H. D. Warren,et al.  An Efficient Easily Adaptable System for Interpreting Natural Language Queries , 1982, CL.

[13]  Edward A. Fox,et al.  Characterization of Two New Experimental Collections in Computer and Information Science Containing Textual and Bibliographic Concepts , 1983 .

[14]  Carol Friedman,et al.  Transporting the linguistic string project system from a medical to a Navy domain , 1985, TOIS.

[15]  Gerard Salton,et al.  A note on information retrieval models and theories , 1985, RIAO.

[16]  Mihai Nadin T. Winograd, Language as a Cognitive Process, Volume I: Syntax , 1985, Artif. Intell..

[17]  Mark Wallace Communicating with databases in natural language , 1984 .

[18]  Tamas E. Doszkocs Natural language processing in intelligent information retrieval , 1985, ACM '85.

[19]  Claudia Marcus Prolog programming: applications for database systems, expert systems, and natural language systems , 1986 .

[20]  Alan F. Smeaton,et al.  Incorporating syntactic information into a document retrieval strategy: an investigation , 1986, SIGIR '86.

[21]  Terry Winograd,et al.  Language as a Cognitive Process , 1983, CL.

[22]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[23]  Terry Winograd,et al.  Language as a cognitive process 1: Syntax , 1982 .

[24]  Gerard Salton,et al.  Recent trends in automatic information retrieval , 1986, SIGIR '86.

[25]  Naomi Sager,et al.  Natural Language Information Processing: A Computer Grammar of English and Its Applications , 1980 .

[26]  Michael E. Lesk Information in data: using the Oxford English dictionary on a computer , 1986, SIGF.

[27]  Jaime G. Carbonell,et al.  A tutorial on techniques and applications for natural language processing , 1983 .

[28]  Michael C. McCord,et al.  Using Slots and Modifiers in Logic Grammars for Natural Language , 1982, Artif. Intell..