A real-time retrieval system using change aware distributed file system

Intranet information retrieval is very important to discover the useful knowledge. In this process, search engine is useful. However, conventional search engines, which are based on centralized architecture, are not suited for the intranet information retrieval because intranet information is frequently updated. Centralized search engines take a long time to collect Web pages by robots. So, we have developed a distributed search engine, cooperative search engine (CSE), to retrieve fresh information. In CSE, a local search engine located in each Web server makes an index of local pages. And, a meta search server integrates these local search engines to realize a global search engine. CSE takes a few minutes to update index because each local search engine scans all files in its own disk. In this paper, we propose the notification mechanism using change aware distributed file system. As this result, we realize the real-time information retrieval.

[1]  Minoru Uehara,et al.  Reliable distributed search engine based on multiple meta servers , 2002, First International Symposium on Cyber Worlds, 2002. Proceedings..

[2]  Jim Fullton,et al.  Architecture of the Whois++ Index Service , 1996, RFC.

[3]  Peter B. Danzig,et al.  The Harvest Information Discovery and Access System , 1995, Comput. Networks ISDN Syst..

[4]  Minoru Uehara,et al.  Persistent cache in Cooperative Search Engine , 2002, Proceedings 22nd International Conference on Distributed Computing Systems Workshops.

[5]  Minoru Uehara A distributed file system for Java Applet based distance learning , 2004, 2004 International Symposium on Applications and the Internet. Proceedings..

[6]  Minoru Uehara,et al.  Query based site selection for distributed search engines , 2003, 23rd International Conference on Distributed Computing Systems Workshops, 2003. Proceedings..

[7]  Minoru Uehara,et al.  Fresh Information Retrieval Using Cooperative Meta Search Engines , 2002, ICOIN.

[8]  David Robinson,et al.  NFS version 4 Protocol , 2000, RFC.