Rama: An architecture for Internet information filtering

This paper describes Rama, a first generation experimental information retrieval and filtering system that attempts to recover useful information from various Internet sources including USENIX news and anonymous FTP servers. The focus of the Rama system to date has been on building a distributed query and information retrieval system, which provides an interface to heterogeneous information services. A user of Rama sends one or more asynchronous queries to a Rama server using existing SMTP e-mail clients. The server periodically searches local and remote Internet services. Searches are prefiltered with the use of timestamps. Data objects which are newer than the timestamp are then searched via a query mechanism which relies on a combination of vector-distance, pattern matching operands, and boolean operators. Results are weighted according to how closely they match queries and are posted via e-mail to the user. Input to the e-mail client can be further filtered — one can use the MH mail system and sort input by weight. Results indicate that the current system is useful and extensible. So far we have assumed that existing e-mail systems will be used for input and output and have not attempted to construct special client interfaces. Efforts are underway to extend the system with WWW searching capabilities and construct a special WWW oriented user-interface.

[1]  Arthur Charles Clarke,et al.  Rendezvous with Rama , 1973 .

[2]  Ari Luotonen,et al.  World-Wide Web Proxies , 1994, Comput. Networks ISDN Syst..

[3]  Hector Garcia-Molina,et al.  SIFT - a Tool for Wide-Area Information Dissemination , 1995, USENIX.

[4]  J. Postel,et al.  File transfer protocol (FTP) , 1985 .

[5]  Ralph E. Droms,et al.  The Knowbot Information Service , 1989 .

[6]  Nathaniel S. Borenstein,et al.  MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies , 1992, RFC.

[7]  Darren R. Hardy,et al.  Essence: A Resource Discovery System Based on Semantic File Indexing , 1993, USENIX Winter.

[8]  Calton Pu,et al.  Applying an information gathering architecture to Netfind: a white pages tool for a changing and growing Internet , 1994, TNET.

[9]  Pattie Maes,et al.  Evolving agents for personalized information filtering , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[10]  Pattie Maes,et al.  Learning Interface Agents , 1993, AAAI.

[11]  Brewster Kahle,et al.  An information system for corporate users: wide area information servers , 1991 .

[12]  Peter B. Danzig,et al.  Harvest: A Scalable, Customizable Discovery and Access System , 1994 .

[13]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[14]  Arthur Charles Clarke The Fountains of Paradise , 1979 .

[15]  D. B. Chapan Majordomo : How I Manage 17 Mailing Lists Without Answering "-request" Mail , 1992 .

[16]  Jonathan B. Postel Simple Mail Transfer Protocol-SMTP , 1992 .