SIFT - a Tool for Wide-Area Information Dissemination

The dissemination model is becoming increasingly important in wide-area information system. In this model, the user subscribes to an information dissemination service by submitting profiles that describe his interests. He then passively receives new, filtered information. The Stanford Information Filtering Tool (SIFT) is a tool to help provide such service. It supports full-text filtering using well-known information retrieval models. The SIFT filtering engine implements novel indexing techniques, capable of processing large volumes of information against a large number of profiles. It runs on several major Unix platforms and is freely available to the public. In this paper we present SIFT's approach to user interest modeling and user-server communication. We demonstrate the processing capability of SIFT by describing a running server that disseminates USENET News. We present an empirical study of SIFT's performance, examining its main memory requirement and ability to scale with information volume and user population.

[1]  Gerald Salton,et al.  Automatic text processing , 1988 .

[2]  Christine L. Borgman,et al.  The whole internet user's guide & catalog , 1994 .

[3]  Hans-Peter Frei,et al.  Retrieval algorithm effectiveness in a wide area network information filter , 1991, SIGIR '91.

[4]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[5]  David K. Gifford,et al.  An Architecture for Large Scale Information Systems , 1985, SOSP.

[6]  Hector Garcia-Molina,et al.  Distributed selective dissemination of information , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[7]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[8]  Susan T. Dumais,et al.  Personalized information delivery: an analysis of information filtering methods , 1992, CACM.

[9]  Hector Garcia-Molina,et al.  Index structures for information filtering under the vector space model , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[10]  David K. Gifford,et al.  An architecture for large scale information systems , 1985, Symposium on Operating Systems Principles.

[11]  Thomas W. Malone,et al.  Intelligent Information Sharing Systems , 1986 .

[12]  Hector Garcia-Molina,et al.  Index structures for selective dissemination of information under the Boolean model , 1994, TODS.

[13]  E. J. Krol,et al.  Book-Review - the Whole Internet - User's Guide and Catalog , 1992 .