Design and evaluation of a multi-agent collaborative Web mining system

Most existing Web search tools work only with individual users and do not help a user benefit from previous search experiences of others. In this paper, we present the Collaborative Spider, a multi-agent system designed to provide post-retrieval analysis and enable across-user collaboration in Web search and mining. This system allows the user to annotate search sessions and share them with other users. We also report a user study designed to evaluate the effectiveness of this system. Our experimental findings show that subjects' search performance was degraded, compared to individual search scenarios in which users had no access to previous searches, when they had access to a limited number (e.g., 1 or 2) of earlier search sessions done by other users. However, search performance improved significantly when subjects had access to more search sessions. This indicates that gain from collaboration through collaborative Web searching and analysis does not outweigh the overhead of browsing and comprehending other users' past searches until a certain number of shared sessions have been reached. In this paper, we also catalog and analyze several different types of user collaboration behavior observed in the context of Web mining.

[1]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[2]  Timothy W. Finin,et al.  KQML as an agent communication language , 1994, CIKM '94.

[3]  Steven J. Plimpton,et al.  Massively parallel methods for engineering and science problems , 1994, CACM.

[4]  Hsinchun Chen,et al.  Comparing noun phrasing techniques for use with medical digital library tools , 2000, J. Am. Soc. Inf. Sci..

[5]  Jay F. Nunamaker,et al.  Verifying the Proximity and Size Hypothesis for Self-Organizing Maps , 2000, J. Manag. Inf. Syst..

[6]  Charles J. Petrie,et al.  Agent-Based Engineering, the Web, and Intelligence , 1996, IEEE Expert.

[7]  Ahmad M. Ahmad Wasfi Collecting user access patterns for building user profiles and collaborative filtering , 1998, IUI '99.

[8]  José A. Pino,et al.  A first step to formally evaluate collaborative work , 1997, GROUP.

[9]  Tim Finin,et al.  Using KQML as an agent communication language , 1994, CIKM 1994.

[10]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[11]  Thorsten Joachims,et al.  WebWatcher : A Learning Apprentice for the World Wide Web , 1995 .

[12]  Marshall Ramsey,et al.  An intelligent personal spider (agent) for dynamic Internet/Intranet searching , 1998, Decis. Support Syst..

[13]  Mark S. Ackerman,et al.  Do-I-Care: a collaborative Web agent , 1996, CHI 1996.

[14]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[15]  Paul B. Kantor,et al.  Capturing human intelligence in the net , 2000, CACM.

[16]  Nicholas J. Belkin,et al.  Evaluation of a tool for visualization of information retrieval results , 1996, SIGIR '96.

[17]  J. Wyatt Decision support systems. , 2000, Journal of the Royal Society of Medicine.

[18]  Hsinchun Chen,et al.  Personalized spiders for web search and analysis , 2001, JCDL '01.

[19]  Hsinchun Chen,et al.  Collaborative systems: solving the vocabulary problem , 1994, Computer.

[20]  Hector Garcia-Molina,et al.  Efficient Crawling Through URL Ordering , 1998, Comput. Networks.

[21]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[22]  Murat Karamuftuoglu Collaborative Information Retrieval: Toward a Social Informatics View of IR Interaction , 1998, J. Am. Soc. Inf. Sci..

[23]  Oren Etzioni,et al.  The MetaCrawler architecture for resource aggregation on the Web , 1997 .

[24]  Mark Ginsburg,et al.  Annotate! a tool for collaborative information retrieval , 1998, Proceedings Seventh IEEE International Workshop on Enabling Technologies: Infrastucture for Collaborative Enterprises (WET ICE '98) (Cat. No.98TB100253).

[25]  Oren Etzioni,et al.  The World-Wide Web: quagmire or gold mine? , 1996, CACM.

[26]  K. P. Sycara Multiagent systems : Special issue on agents , 1998 .

[27]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[28]  Jay F. Nunamaker,et al.  Collaborative information retrieval environment: integration of information retrieval with group support systems , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[29]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[30]  Hsinchun Chen,et al.  CI Spider: a tool for competitive intelligence on the Web , 2002, Decis. Support Syst..

[31]  Katia P. Sycara,et al.  Coordination of Multiple Intelligent Software Agents , 1996, Int. J. Cooperative Inf. Syst..

[32]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[33]  Hsinchun Chen,et al.  Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques , 1998, J. Am. Soc. Inf. Sci..

[34]  Charles J. Petrie,et al.  JATLite: A Java Agent Infrastructure with Message Routing , 2000, IEEE Internet Comput..

[35]  Johanna D. Moore,et al.  Proceedings of the Conference on Human Factors in Computing Systems , 1989 .

[36]  Alberto RibesAbstract,et al.  Multi agent systems , 2019, Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005)..

[37]  Hsinchun Chen,et al.  Intelligent internet searching agent based on hybrid simulated annealing , 2000, Decis. Support Syst..

[38]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[39]  Hsinchun Chen,et al.  MetaSpider: Meta-searching and categorization on the Web , 2001, J. Assoc. Inf. Sci. Technol..

[40]  Nicholas R. Jennings,et al.  A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[41]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[42]  M. I. Mauldin,et al.  Lycos: design choices in an Internet search service , 1997 .

[43]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[44]  Wanda J. Orlikowski,et al.  Learning from Notes: organizational issues in groupware implementation , 1992, CSCW '92.

[45]  Robin Jeffries,et al.  Information artisans: patterns of result sharing by information searchers , 1993, COCS '93.

[46]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[47]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[48]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[49]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[50]  Tim Finin,et al.  A Language and Protocol to Support Intelligent Agent Interoperability , 1992 .

[51]  Donna K. Harman,et al.  Overview of the Sixth Text REtrieval Conference (TREC-6) , 1997, Inf. Process. Manag..