Even experienced users of IR systems experience a high degree of frustration in searching for information on the World Wide Web, in part because current search engines concentrate on speed and coverage at the expense of precision. In this paper, we describe an approach to increase precision of retrieval based on ltering out irrelevant material. Potentially relevant matches got from a standard Web search engine are ltered using, for example, augmented patterns derived from syntactic structure inherent in natural language text. We argue that the performance of these and other methods of ltering for IR can be improved by the notion of server side scripting, a concept which has not been exploited yet. We describe an implementation of such a system, and discuss issues that arise out this model of improving IR. We conclude with a discussion of areas where this mode of ltering is most appropriate. Abstract Even experienced users of IR systems experience a high degree of frustration in searching for information on the World Wide Web, in part because current search engines concentrate on speed and coverage at the expense of precision. In this paper, we describe an approach to increase precision of retrieval based on ltering out irrelevant material. Potentially relevant matches got from a standard Web search engine are ltered using, for example, augmented patterns derived from syntactic structure inherent in natural language text. We argue that the performance of these and other methods of ltering for IR can be improved by the notion of server side scripting, a concept which has not been exploited yet. We describe an implementation of such a system, and discuss issues that arise out this model of improving IR. We conclude with a discussion of areas where this mode of ltering is most appropriate.
[1]
Michael McGill,et al.
Introduction to Modern Information Retrieval
,
1983
.
[2]
L. R. Rasmussen,et al.
In information retrieval: data structures and algorithms
,
1992
.
[3]
Srinivas Bangalore,et al.
The Institute For Research In Cognitive Science Disambiguation of Super Parts of Speech ( or Supertags ) : Almost Parsing by Aravind
,
1995
.
[4]
Pattie Maes,et al.
Agents that reduce work and information overload
,
1994,
CACM.
[5]
Breck Baldwin,et al.
Mother of PERL: A Multi-tier Pattern Description Language
,
1996
.
[6]
Ken Arnold,et al.
The Java Programming Language
,
1996
.
[7]
Tim Bray,et al.
Measuring the Web
,
1996,
World Wide Web J..
[8]
Raman Chandrasekar,et al.
Gleaning Information from the Web: Using Syntax to Filter Out Irrelevant Information
,
1996
.
[9]
Roy T. Fielding,et al.
Hypertext Transfer Protocol - HTTP/1.0
,
1996,
RFC.